Section 01
[Introduction] SMMU Benchmark: Filling the Gap in Social Intelligence Evaluation for Multimodal Large Models
This article introduces the SMMU (Social Intelligence Benchmark for Multimodal Understanding) project, a benchmark framework specifically designed to evaluate the social intelligence capabilities of multimodal large language models. While the current AI evaluation system is comprehensive, it has long overlooked social intelligence—a core capability that is crucial for AI to integrate into human society. By deconstructing social intelligence into multiple dimensions (emotion recognition, theory of mind, social context understanding, reasoning and prediction), and adopting a multimodal test design and hybrid evaluation method, SMMU fills this evaluation gap, providing model developers and researchers with diagnostic tools and a common platform to drive AI evaluation toward a direction that is closer to human real-world capabilities.