Section 01
【Introduction】Core Summary of Research on Functional Metacognitive States in LLMs
The study reveals that large language models (LLMs) have an internal decomposable space of functional metacognitive states. Using residual stream analysis and activation steering techniques, these states can be linearly decoded and causally regulate reasoning behavior. This finding is crucial for the reliability of AI evaluation, model alignment, and understanding of neural mechanisms.