Section 01
Introduction: QEVA—A New Paradigm for Reference-Free Video Summarization Evaluation
Traditional video summarization evaluation relies on manual reference answers, which has problems such as high cost and insufficient semantic capture. QEVA proposes a new reference-free evaluation paradigm, assessing summary quality across three dimensions (coverage, factuality, and temporal order) via multimodal question answering, and releases the MLVU(VS)-Eval benchmark dataset. Experimental results are highly consistent with human judgments.