Section 01
Introduction: MR-ALIGN—Enhancing Factual Accuracy of Large Reasoning Models via Meta-Reasoning
MR-ALIGN is a meta-reasoning guided alignment framework that enhances the factual accuracy of large reasoning models by tracking state transition probabilities in reasoning trajectories, improving the performance of factual question answering without external validators. The related research has been accepted by ACL 2026 Findings.