Section 01
R2R: Efficient Reasoning Path Exploration via Collaborative Routing Between Small and Large Models (Introduction)
The NeurIPS 2025 paper R2R proposes a token routing mechanism for collaboration between small and large models to address the high inference cost of large models, significantly reducing computational costs (e.g., 40-60% cost reduction in math tasks) while maintaining reasoning quality.