Section 01
ContextRL: A New Method to Enhance Long-Range Reasoning and Multimodal Capabilities of Large Models
ContextRL is a context-aware reinforcement learning method published on arXiv in June 2026. Its core is to train models to identify key evidence through contrastive context selection tasks, solving the problem of key evidence localization in large models' long-range reasoning and multimodal scenarios. It achieves a 2.2% improvement in code agent tasks and a 1.8% improvement in multimodal reasoning tasks.