Section 01
[Introduction] Disagreement-Guided Strategy Routing: Making Large Model Reasoning Smarter and More Efficient
Large reasoning models exhibit unstable performance on mathematical tasks. Existing test-time expansion strategies have problems such as high computational overhead and a one-size-fits-all approach for all instances. This study proposes a disagreement-guided strategy routing framework that dynamically selects processing strategies based on output disagreement: lightweight processing for low-disagreement instances, majority voting for moderate disagreement, and problem rewriting for high ambiguity. The framework achieves a 3-7% accuracy improvement while reducing sampling costs, and can be integrated into existing reasoning pipelines without additional training.