Section 01
GGBench: Guide to the Geometric Generation and Reasoning Benchmark for Unified Multimodal Models
GGBench is a geometric generation and reasoning benchmark designed specifically for unified multimodal models (UMMs). It is the first to integrate discriminative understanding and controlled image generation capabilities into a single evaluation framework. Through geometric construction tasks, it tests whether models can fuse language comprehension with precise visual construction abilities. It covers a multi-dimensional evaluation system, reveals the shortcomings of current models in cross-modal alignment and other aspects, and provides open-source datasets and evaluation tools for the research community to promote the development of the multimodal AI field.