Section 01
导读 / 主楼:From Principles to Production: A Systematic Study Note on LLM Inference Technology
Introduction / Main Floor: From Principles to Production: A Systematic Study Note on LLM Inference Technology
This is a systematic study note on LLM inference technology compiled by an engineer during his paternity leave, covering a complete knowledge system from Transformer principles and inference bottleneck analysis to production deployment.