Section 01
RASPRef Framework: Retrieval-Augmented Self-Supervised Prompt Optimization to Enhance Large Model Reasoning Capabilities
This article proposes RASPRef (Retrieval-Augmented Self-Supervised Prompt Optimization Framework) to address the prompt sensitivity challenge of reasoning models. By retrieving relevant examples and historical reasoning trajectories, it iteratively optimizes prompts using multi-sample consistency, validator feedback, and model self-criticism signals. It significantly improves mathematical reasoning performance without manual annotation. This framework solves the time-consuming issue of manual prompt engineering and the high annotation cost dependency of existing methods, providing a new solution for the practical application of reasoning models.