Section 01
[Introduction] RIEQE Framework: Enhancing Translation Quality Estimation Capabilities of Large Models via Synergistic Evolution of Implicit and Explicit Reasoning
Core Information
- Research Outcome: Propose the RIEQE two-stage training framework, which achieves the synergistic evolution of implicit and explicit reasoning through NonThinking-SFT and Thinking-RLVR training, and outperforms all baseline models on the WMT test set
- Original Author/Source: arXiv submission (published on May 29, 2026), title Unlocking Fine-Grained Translation Quality Estimation in LRMs through Synergistically Evolving Implicit and Explicit Reasoning, link: http://arxiv.org/abs/2605.31378v1
- Keywords: Translation Quality Estimation, Large Reasoning Models, Implicit Reasoning, Explicit Reasoning, Reinforcement Learning, Machine Translation, Qwen, WMT
This framework aims to address the performance bottleneck of Large Reasoning Models (LRMs) in fine-grained Translation Quality Estimation (QE) tasks, and enhance model capabilities by synergizing the two reasoning modes.