Back to Quant Hub

OLMo 2 7B Instruct

7B

Allen AI OLMo

Fully open training pipeline from Allen AI. Great for reproducibility research.

⬇ 1.7K HF downloads♥ 12 likesbartowski/OLMo-2-1124-7B-Instruct-GGUF· stats from 6/24/2026

Consumer GPUMac / Apple SiliconCPU / VPS

4K

Max Context

2

Quant Variants

GGUF Q8_0

Best Quality

99.7%

Accuracy Retained

Calculate VRAM Hugging Face Compare

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

Format	Level	BPW	VRAM	PPL Loss	Speed	Actions
GGUF	Q4_K_M	4.85	5.3 GB	3.4%	150 tok/s	Calc HF
GGUF	Q8_0	8.5	8.2 GB	0.3%	125 tok/s	Calc HF