Section 01
[Introduction] PruneTIR: Inference-Time Pruning Improves LLM Tool Integration Efficiency and Accuracy
The PruneTIR framework significantly improves the reasoning efficiency and accuracy of tool-augmented LLMs through three inference-time optimization strategies—success-triggered pruning, stuck-triggered pruning and resampling, and retry-triggered tool pausing—without additional training. Addressing the gap of neglecting inference-time optimization in tool-integrated reasoning, it provides a highly cost-effective solution for practical applications.