Section 01
Reasoning models' internal trajectories are truly different; the difference is most significant in the code domain
Recent research finds that when reasoning-trained language models solve difficult problems, the geometric characteristics of their internal hidden state trajectories have systematic differences from ordinary instruction-tuned models, and this difference is most pronounced in the code domain. This post will detail the background, methods, findings, and significance of this study.