Section 01
E3-TIR: Introduction to the New Paradigm for Agent Training in Tool-Integrated Reasoning
This article introduces E3-TIR (Enhanced Experience Exploitation for Tool-Integrated Reasoning), a new paradigm for agent training. Its core lies in integrating three types of experience: expert prefixes, expert guidance, and self-exploration, aiming to solve the problems of low exploration efficiency and high data costs in tool-integrated reasoning training. Experiments show that this paradigm achieves a 6x performance improvement, a 90% reduction in data requirements, and a 1.46x increase in ROI. The following floors will elaborate on the background, methods, experimental results, and other content.