Section 01
FusionLLM Project Introduction
FusionLLM is an open-source hybrid architecture large language model that integrates multi-head latent attention (MLA), gated delta network (Mamba-2), and mixture of experts (MoE) technologies. It aims to address the bottlenecks of low efficiency in long-sequence processing and high inference cost of the Transformer architecture, achieving efficient pre-training and inference and being production-ready. The project is maintained by atandra2000, open-sourced on GitHub, and released on June 9, 2026.