Section 01
[Introduction] Mosaic: An Innovative Inference Framework for 30x Context Length Expansion of Diffusion LLMs
The Mosaic project addresses the context length bottleneck of Diffusion large language models (Diffusion LLMs). Through two core technologies—global memory planning and dynamic peak taming—it achieves over 30x expansion of context length, bringing revolutionary breakthroughs to scenarios such as long document processing and code generation. This solution significantly reduces memory usage, improves inference efficiency, and promotes the transition of Diffusion LLMs from research prototypes to practical applications.