Section 01
Introduction: In-Depth Analysis of MEBench, a New Benchmark for Cross-Document Multi-Entity QA
MEBench is a cross-document multi-entity QA benchmark framework accepted by the EMNLP 2025 main conference, specifically designed to evaluate large language models' cross-document multi-entity QA capabilities. It addresses the reasoning challenges posed by scattered information in real-world scenarios, covering core content such as dataset construction, evaluation metrics, and experimental results, helping to understand the capabilities and limitations of large models in complex information integration tasks.