Section 01
[Introduction] Predicting Future Behavior: A New Paradigm for Controlled Generation of Large Reasoning Models
Large reasoning models (such as DeepSeek-R1 and OpenAI o1) possess strong multi-step reasoning capabilities, but they face unpredictability issues that hinder practical deployment. This study proposes training activation probes to predict the future behavior of models and develops the Future Probe Controlled Generation (FPCG) method based on this, enabling effective guidance with almost no reduction in output quality, thus opening up a new direction for research on the controllability of reasoning models.