Back to Quant Hub

Mistral 7B Instruct v0.3

7B

Mistral AI

Classic Mistral 7B v0.3. Still a reliable baseline for local chat APIs.

32.9K HF downloads41 likesbartowski/Mistral-7B-Instruct-v0.3-GGUF· stats from 6/24/2026
Consumer GPUMac / Apple SiliconCPU / VPS

33K

Max Context

3

Quant Variants

GGUF Q6_K

Best Quality

99.1%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.855.2 GB3.2%158 tok/s
GGUFQ6_K6.566.8 GB0.9%135 tok/s
AWQINT444.6 GB4.5%225 tok/s