Section 01
【Introduction】Forms of Overthinking: Core Summary of Backtracking Burst Pattern Research
This paper addresses the problem that useful self-correction and ineffective self-doubt are hard to distinguish in long trajectories of reasoning models. By analyzing 6000 AIME reasoning trajectories from Qwen3-8B, it finds that correct trajectories mostly have early isolated mild backtracking, while incorrect trajectories show clustered moderate-to-severe backtracking bursts in the middle and late stages. Based on this, a backtracking-aware early exit strategy is proposed, providing new ideas for optimizing reasoning processes. Research source: arXiv 2026-05-27, link http://arxiv.org/abs/2605.27965v1.