Section 01
Introduction: RLER—A New Paradigm for Video Reasoning Combining Reinforcement Learning and Evidence Election
This paper proposes the RLER dual-paradigm framework, which uses reinforcement learning to train models to generate structured evidence, then selects reliable answers via a training-free evidence weighted election mechanism. It achieves SOTA on 8 video reasoning benchmarks, with an average improvement of 6.3% and only requiring 3.1 candidates.