Back to Quant Hub
Llama 3.2 1B Instruct
1BMeta Llama 3.2
Ultra-light Llama for mobile and embedded. Sub-2GB VRAM with Q4.
Consumer GPUMac / Apple SiliconCPU / VPS
131K
Max Context
2
Quant Variants
GGUF Q8_0
Best Quality
99.5%
Accuracy Retained
Meta Llama 3.2
Ultra-light Llama for mobile and embedded. Sub-2GB VRAM with Q4.
131K
Max Context
2
Quant Variants
GGUF Q8_0
Best Quality
99.5%
Accuracy Retained