Section 01
Introduction: The UniReasoner Framework Bridges the Understanding-Generation Gap in Visual Generation
This paper proposes a formal definition of the understanding-generation gap and the UniReasoner framework. By using LLMs to generate visual drafts, perform self-critical evaluations, and output correction signals to guide diffusion models, it significantly improves compositional alignment and semantic fidelity while maintaining image quality.