OpenVINO quant of agentica-org/DeepCoder-14B-Preview

  • Requires 12GB of VRAM (eg. Intel Arc A770 / B580).
  • Won't fit on 8GB A750

Performance on an A770 with OpenArc

=== Streaming Performance ===
Total generation time: 65.078 seconds
Prompt evaluation: 1376 tokens in 0.841 seconds (1636.58 T/s)
Response generation: 982 tokens in (15.09 T/s)
Downloads last month
13
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Gapeleon/DeepCoder-14B-Preview-int4-awq-ov