Section 01
[Introduction] OCRBench Series Benchmarks: Key Tools for Comprehensive Evaluation of OCR Capabilities in Large Language Models
Optical Character Recognition (OCR) technology has undergone transformation with the rise of Large Multimodal Models (LMMs). However, traditional evaluations only focus on character/word accuracy and fail to cover the comprehensive capabilities of LMMs such as semantic understanding and information extraction. The OCRBench series of benchmarks (including the original OCRBench, v2, and MDPBench) emerged to fill this gap in comprehensive evaluation, providing a systematic assessment tool for the research community and driving progress in the OCR field.