章节 01
AgentVista: A Multi-Modal Agent Visual Task Evaluation Platform (导读)
AgentVista is a specialized platform for evaluating multi-modal agents' performance in complex, real-world visual tasks. It focuses on testing their capabilities in multi-step workflows, dynamic environments, tool use, and long-term visual reasoning, helping researchers and developers understand their actual performance in challenging image scenarios. This post will break down its background, features, usage, and value.