Section 01
VEBench: Guide to the First Large Model Evaluation Benchmark for Video Editing Scenarios
VEBench is the first benchmark to systematically evaluate the video editing understanding and operational reasoning capabilities of Large Multimodal Models (LMMs), containing 3.9K high-quality edited videos (with a total duration of over 257 hours) and 3,080 human-validated question-answer pairs. Experiments reveal a significant gap between current models and human-level editing cognition, pointing the way for the development of intelligent video editing systems.