Section 01
[Introduction] Reasoning Trace Collapse: A Hidden Crisis in Fine-tuning Explicit Reasoning Models
This paper reveals the Reasoning Trace Collapse phenomenon in explicit reasoning models (e.g., DeepSeek-R1, OpenAI o1) during downstream fine-tuning—models can still maintain correct answers but lose structured intermediate reasoning processes. This phenomenon is highly covert and undermines the model's interpretability and reliability. The study proposes a structural evaluation framework to detect the problem and uses a loss masking strategy to mitigate the collapse, providing key guidance for the fine-tuning and application of explicit reasoning models.