Section 01
Introduction: Key Points of Cost-Aware Optimization for Agent Query Execution
This article proposes a new paradigm called "Agent Query Execution" and its corresponding optimizer EnumGRPO. By interleaving planning and execution of Large Language Model (LLM)-based agents, it achieves joint optimization of query cost and answer quality. In the SWAN benchmark test, this method achieves a 317x cost reduction and an 18% accuracy improvement.