Section 01
[Introduction] New Framework for Web Agent Observation Compression: MFS Enables Evaluation Speedup and Performance Optimization
Web Agents based on large language models are constrained by the problem of excessively long HTML observations. The latest research proposes Minimal Failure Sets (MFS) as a proxy metric for HTML compression effectiveness, achieving over 100x evaluation speedup. Pruning programs optimized based on MFS reduce latency by 2-3x while maintaining 84-89% task success rates on WorkArena and WebLinx.