Section 01
PAGER: Bridging the Semantic-Execution Gap in Precise Control of Geometric GUI [Introduction]
PAGER is a topology-aware agent architecture specifically designed to solve the precise point control challenge in geometric construction GUI tasks. By combining structured planning and pixel-level execution, it increases task success rate from less than 6% to over 62%, setting a new standard for point-precise GUI control. This article provides a detailed analysis of this research.