Open Source · Edge Deployment · Geek-First

Quantize 
Everything.

Bridge the gap between research papers and real-world deployment. Run state-of-the-art LLMs on consumer hardware.

GGUF
AWQ
EXL2
GPTQ
HQQ
& more
30Models Indexed
5Formats Tracked
33GPUs in Database
98.3%Avg Accuracy Retained

Format Heat Index

Community adoption · this week

  • 1GGUF
    89%+3%
  • 2AWQ
    45%+7%
  • 3EXL2
    32%0%
  • 4GPTQ
    28%-2%
  • 5HQQ
    18%+12%

vs last week

Editorial estimate based on Hugging Face GGUF download share and r/LocalLLaMA discussion volume — not a live feed

Data Changelog

Last updated 2026-06-24

  • 2026-06-24Expanded model index to 30+ entries; added format wizard, hardware profile, ExLlamaV2 CLI, SEO (sitemap/OG), data transparency
  • 2026-06-24Model detail pages, GPU reverse lookup, shareable VRAM calculator URLs, real homepage stats
  • 2025-06-10Initial launch: Quant Hub, VRAM calculator, CLI generator, benchmarks, cookbook

All data is manually curated and verified against community sources. Always cross-check with official Hugging Face model cards before deployment.