Section 01
[Introduction] HERMES++: A Unified Driving World Model Integrating 3D Scene Understanding and Prediction
Autonomous driving technology faces the core dilemma of separating 3D scene semantic understanding and future geometric prediction; existing world models often lean towards one end. HERMES++ integrates the two into a single framework for the first time through four innovative designs: BEV representation, LLM-enhanced world query, current-future link, and joint geometric optimization, outperforming specialized methods in multiple benchmark tests and providing comprehensive capabilities for autonomous driving systems.