Section 01
XTC-Bench: A New Breakthrough in Cross-Task Consistency Evaluation for Unified Multimodal Models
This article introduces XTC-Bench—a scene graph-driven evaluation framework that, using the CCTA metric, systematically assesses the semantic consistency between understanding and generation tasks for unified multimodal models for the first time. Key findings include: high accuracy does not equal high consistency, and architectural unification does not imply representational unification, which provides critical insights for model development.