Section 01
[Introduction] Rethinking Generalization in Reasoning SFT: Conditional Analysis of Optimization, Data, and Model Capabilities
This study systematically analyzes the generalization problem of reasoning supervised fine-tuning (SFT) from three dimensions: optimization, data, and model capabilities, revealing the key factors affecting generalization performance and their interaction mechanisms. The study points out that generalization is a complex phenomenon involving multi-factor interactions, and traditional single-factor analysis is insufficient. It provides a conditional analysis framework and practical guidance for improving the generalization ability of reasoning models.