Back to Quant Hub
Llama 3.2 11B Vision Instruct
11BMeta Llama 3.2
Multimodal Llama with image understanding. Vision encoder adds ~2GB VRAM overhead.
Consumer GPUMac / Apple Silicon
131K
Max Context
2
Quant Variants
GGUF Q8_0
Best Quality
99.5%
Accuracy Retained