Section 01
S3 Dataset: Guide to the Significant Breakthrough of Multimodal Large Models in Medical Video Understanding
Seizure-Semiology-Suite (S3) is the first multimodal dataset and benchmark for understanding seizure semiology, containing 438 seizure videos and over 35,000 dense annotations covering 20 ILAE-defined semiological features. This study reveals the systemic weaknesses of current multimodal large language models (MLLMs) in medical video understanding and proposes improvement solutions, providing key benchmarks and development directions for the medical AI field.