Section 01
Introduction: LPSR—A New Inference-Time Error Correction Method Without Fine-Tuning
LPSR (Latent Phase-Shift Rollback) is an inference-time error correction method that requires no fine-tuning or additional training. It detects errors in real time by monitoring phase shifts in residual streams, rolls back KV caches, and injects guidance vectors, significantly improving the performance of large language models (LLMs) on mathematical reasoning tasks. Its core innovation lies in using changes in the model's internal representations to implement interventions. In the MATH-500 benchmark test, the 8B model outperformed the standard 70B model, demonstrating efficient parameter and computational efficiency.