Section 01
[Introduction] Core Findings of Chain-of-Thought Faithfulness Study: Reasoning Models Are More Reliable Than Instruction Models
An empirical study on chain-of-thought faithfulness reveals key differences between instruction models and reasoning models in explaining their own reasoning processes: reasoning models can more faithfully reflect their internal decision-making mechanisms. This article will cover background, core findings, experimental methods, reasons for differences, application implications, etc. The research code and data have been open-sourced, providing a reference for understanding model interpretability.