Section 01
VitaLLM: Ultra-Compact Ternary LLM Accelerator—A New Breakthrough in Edge AI
Introduction: VitaLLM is a hardware-software co-designed ternary LLM inference accelerator for edge devices. Through innovations like the heterogeneous dual-core computing strategy and dependency-aware scheduling framework, it achieves a decoding throughput of 70.70 tokens/s with an area of 0.223 mm² and power consumption of 65.97 mW, providing an efficient solution for edge LLM deployment.