Section 01
Omni Model: A Breakthrough in Cross-Modal Reasoning via Native Multimodal Training and Context Unfolding Mechanism
Omni is a unified multimodal model natively supporting text, images, videos, 3D geometry, and hidden representations. Its native multimodal training gives rise to the 'Context Unfolding' mechanism, allowing the model to explicitly reason across multiple modal representations before generating predictions, thus bringing new breakthroughs to cross-modal intelligence.