Section 01
[Introduction] Research on Reasoning Primitives of Hybrid Architecture LLMs: A Decoupled Analysis of Retrieval and State Tracking
Recent research decomposes the reasoning capabilities of LLMs into two fundamental primitives: retrieval (retrieving information from trained knowledge) and state tracking (maintaining and updating intermediate states). The study finds that hybrid architectures (combining attention-based retrieval and cyclic state updates) significantly outperform pure attention models in state tracking tasks without sacrificing retrieval ability. This discovery provides new ideas for selecting appropriate architectures for different application scenarios, promoting the understanding of LLM reasoning capabilities from a black-box to a white-box approach.