Section 01
[Introduction] S-Bench: A Benchmark for Evaluating Social Intelligence of Multimodal Large Language Models
S-Bench is the first comprehensive benchmark suite dedicated to evaluating the social intelligence capabilities of multimodal large language models. Addressing the limitations of existing evaluations, it covers dimensions such as theory of mind, emotion recognition, and social norms, using multimodal inputs and multi-dimensional evaluation metrics. It provides a standardized tool for model development, product selection, and academic research, while promoting future directions like cross-cultural expansion and dynamic interactive evaluation through the open-source community.