Section 01
[Introduction] Systematic Design Exploration of LUT Hardware Accelerators for 1.58-bit LLM Inference
This article introduces a systematic design framework for lookup table (LUT) hardware accelerators targeting 1.58-bit quantized LLMs. Using an open-source hardware generator and an analytical cost model, it achieves a 2.2x area reduction under the TSMC 16nm process and reveals the critical impact of activation data types on architecture selection.