Section 01
MMSU: Introduction to the New Benchmark for Evaluating Social Intelligence of Multimodal Large Language Models
MMSU (Multimodal Social Understanding) is an evaluation benchmark for the social intelligence capabilities of multimodal large language models, filling the gap in the current AI evaluation system for measuring social cognitive abilities. It provides a systematic framework to assess models' understanding and reasoning abilities in complex social scenarios, covering multiple dimensions such as emotion recognition and social context reasoning. Preliminary evaluations reveal that mainstream models have significant shortcomings in social intelligence, which is of great value for AI research, development, and industry applications.