Section 01
LLM Quantization Gallery: A Visual Encyclopedia of 93 Large Model Quantization Methods
In the era of large language models (LLMs) with billions to trillions of parameters, quantization is a core model compression technique to reduce computational and storage pressure while preserving performance. However, the proliferation of algorithms (from GPTQ/AWQ to QuaRot/AQLM) makes systematic understanding challenging. The LLM Quantization Gallery—an open-source knowledge base maintained by Arpit Singh Gautam—addresses this by visually organizing 93 quantization methods across 8 categories, each with flowcharts, technical cards, and cross-references, serving as a 'visual encyclopedia' for model compression learning.