Back to Quant Hub

Aya 23 8B

8B

Cohere For AI

Multilingual specialist covering 23 languages. Strong for non-English local apps.

3.5K HF downloads55 likesbartowski/aya-23-8B-GGUF· stats from 6/24/2026
Consumer GPUMac / Apple SiliconCPU / VPS

8K

Max Context

2

Quant Variants

GGUF Q4_K_M

Best Quality

96.8%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.855.7 GB3.2%145 tok/s
AWQINT445.0 GB4.6%205 tok/s