Back to Quant Hub

DeepSeek-Coder-V2-Lite Instruct

16B

DeepSeek

MoE architecture coding model. Active params ~2.4B, total ~16B. Exceptional code quality.

76.5K HF downloads166 likesbartowski/DeepSeek-Coder-V2-Lite-Instruct-GGUF· stats from 6/24/2026
Consumer GPUMac / Apple Silicon

164K

Max Context

3

Quant Variants

GGUF Q8_0

Best Quality

99.8%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.8511.1 GB3.1%145 tok/s
GGUFQ8_08.517.5 GB0.2%118 tok/s
AWQINT449.8 GB4.1%192 tok/s