Section 01
[Introduction] SymbolBench: A Professional Evaluation Benchmark for Visual Symbol Understanding of Multimodal Large Language Models
The Knowledge Engineering Laboratory of Tsinghua University has launched SymbolBench, a comprehensive benchmark specifically designed to evaluate the discrete visual symbol recognition, parsing, and reasoning capabilities of multimodal large language models (MLLMs). It fills the gap in the current evaluation system for structured visual understanding. This benchmark follows the design principles of comprehensiveness, hierarchy, and practicality, covering multiple symbol types and multi-dimensional tasks. It reveals the capability stratification phenomenon of current mainstream models in symbol understanding and provides improvement directions for the research community.