Section 01
EdgeLLM-Systems: Introduction to the Research Framework for Large Model Inference Systems on Edge Devices
EdgeLLM-Systems is a GitHub project maintained by TianyiLan (Original link: https://github.com/TianyiLan/EdgeLLM-Systems, Update time: 2026-06-13T13:47:25Z), focusing on research of large model inference systems in resource-constrained edge environments. The project provides a complete toolchain for performance profiling, memory footprint analysis, and inference efficiency evaluation, supporting deployment optimization of models like LLaMA on edge devices. Its core content covers target edge platform classification, three-dimensional measurement framework, experimental results, technical toolchain, and future directions, providing data-driven references for edge AI deployment.