InternLM2 20B Chat

20B

Shanghai AI Lab

Mid-size InternLM2 with excellent Chinese comprehension. Fits 24GB at Q4.

⬇ 1.7K HF downloads♥ 8 likesbartowski/internlm2_5-20b-chat-GGUF· stats from 6/24/2026

Consumer GPUPro GPU

33K

Max Context

Quant Variants

GGUF Q5_K_M

Best Quality

98.6%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

Format	Level	BPW	VRAM	PPL Loss	Speed	Actions
GGUF	Q4_K_M	4.85	13.8 GB	2.9%	78 tok/s	Calc HF
GGUF	Q5_K_M	5.68	15.8 GB	1.4%	68 tok/s	Calc HF