Section 01
Lumen: Core Guide to the From-Scratch LLM Inference Compiler
Lumen Core Guide
Lumen is a compiler and runtime system designed specifically for large language model (LLM) inference. It enables automatic synthesis of quantization kernels through self-developed DSL, IR, and code generators, while prioritizing inference optimization for Korean LLMs. Its core goal is to address the pain point of manually writing quantization kernels in existing solutions, improving inference efficiency and the iteration speed of new quantization technologies.