Section 01
WBench: Introduction to the Comprehensive Multi-Round Benchmark for Evaluating Interactive Video World Models
The Meituan team has launched the WBench benchmark, aiming to comprehensively evaluate interactive video world models. This benchmark covers 289 test cases and 1058 interaction rounds, and assesses models from five dimensions: video quality, setting adherence, interaction adherence, consistency, and physical compliance. The code and data have been open-sourced (GitHub link: https://github.com/meituan-longcat/WBench), providing a unified evaluation standard for academia and industry.