Section 01
SOTOPIA-TOM Benchmark Framework: Information Management and Theory of Mind Evaluation in Multi-Agent Interactions
SOTOPIA-TOM is a multi-dimensional benchmark framework designed to evaluate the ability of LLM agents to manage information in multi-party interaction scenarios with information asymmetry and privacy sensitivity. This framework reveals the persistent limitations of current models in complex coordination scenarios, and Theory of Mind (ToM) interventions have been proven to significantly improve agents' information management performance.