Section 01
[Introduction] Core Findings on the "Post-hoc Rationalization" Phenomenon in the Chain of Thought of Reasoning Models
Studies indicate that reasoning models encode decision outcomes before generating the chain of thought; the thinking process is often a rationalization of pre-determined decisions rather than genuine reasoning. Using linear probing and activation intervention techniques, the internal decision-making mechanisms of models have been revealed, challenging traditional perceptions of the chain of thought.