Zing Forum

Reading

Large-Scale Procedural Knowledge Retrieval: A New Paradigm for Enhancing Reasoning Capabilities with Reasoning Memory

Reasoning Memory enables reasoning models to reuse historical reasoning experiences by retrieving relevant subroutines from 32 million procedural knowledge entries, achieving significant improvements in math, science, and coding tasks.

Reasoning Memory程序性知识检索增强生成测试时扩展推理模型RAG知识复用
Published 2026-04-02 04:01Recent activity 2026-04-03 10:52Estimated read 5 min
Large-Scale Procedural Knowledge Retrieval: A New Paradigm for Enhancing Reasoning Capabilities with Reasoning Memory
1

Section 01

[Introduction] Large-Scale Procedural Knowledge Retrieval: A New Paradigm for Enhancing Reasoning Capabilities with Reasoning Memory

Reasoning Memory is a retrieval-augmented generation framework for reasoning models. By retrieving relevant subroutines from 32 million procedural knowledge entries, it allows models to reuse historical reasoning experiences and achieve significant performance improvements in math, science, and coding tasks. Its core lies in addressing the limitations of existing test-time expansion methods, which handle problems in isolation and cannot reuse procedural knowledge, thus bringing a paradigm shift to reasoning models.

2

Section 02

Background: Limitations of Existing Test-Time Expansion Methods

Test-time expansion methods (e.g., Chain of Thought, multiple sampling) can improve the accuracy of reasoning models, but they have fundamental limitations: each problem is handled in isolation, making it impossible to systematically reuse experiences from similar historical problems, especially ignoring procedural knowledge (metacognitive skills such as problem decomposition, strategy selection, verification and backtracking), which forces models to build reasoning processes from scratch.

3

Section 03

Methodology: Core Ideas and Technical Implementation of Reasoning Memory

Reasoning Memory focuses on retrieving and reusing procedural knowledge (knowledge of "how to do things", such as problem decomposition, strategy selection, etc.). Its technical implementation includes: 1. Trajectory decomposition: splitting existing reasoning trajectories into 32 million "subproblem-subroutine" pairs to form a knowledge base; 2. Retrieval during reasoning: simulating the reasoning method of human experts reusing strategies through three steps: explicit subproblem formulation → retrieval of relevant subroutines → procedural prior reasoning.

4

Section 04

Evidence: Experimental Results and Analysis of Success Factors

In six benchmark tests including math, science, and coding, Reasoning Memory consistently outperforms comparison methods such as traditional document RAG and full-trajectory RAG: it improves by up to 19.2% compared to methods without retrieval, and by an average of 7.9% compared to the strongest computational matching baseline. Ablation studies show that the key to success lies in: the extensive procedural coverage of source trajectories, and the decomposition and retrieval design of subproblem-subroutine pairs.

5

Section 05

Conclusion: Implications for AI Systems and Practical Application Value

Implications: Shift from "memorizing facts" to "learning to solve problems", expand retrieval augmentation to areas such as strategies/methods/verification rules, inspired by human cognition (experts rely on procedural knowledge). Application value: Improve reasoning efficiency (avoid repeated exploration), enhance interpretability (trace decision-making basis), and support continuous improvement (the knowledge base forms a closed loop as it expands with new problems).

6

Section 06

Future Directions: Limitations and Improvement Paths

Current limitations: High cost of knowledge base construction, retrieval accuracy depends on design, strong domain specificity. Future directions: Dynamic knowledge base update (automatically extract procedural knowledge from new reasoning), cross-domain migration, deep integration of retrieval and reasoning, and building a hierarchical procedural knowledge system.