Section 01
[Overview] Multi-Agent Systems Breakthrough in Screen Learning Behavior Analysis: A Comparative Study of Single-Agent vs. Multi-Agent VLMs
This article focuses on research using Vision-Language Models (VLMs) for automated analysis of screen learning behavior, comparing the performance of single-agent and multi-agent architectures in scene detection and action recognition tasks, proposing two innovative multi-agent frameworks and verifying their superiority, providing an efficient and scalable multimodal data analysis solution for the field of educational technology.