Section 01
V-tableR1: A New Framework Ushering in the Verifiable Era of Multimodal Table Reasoning
This article introduces the V-tableR1 framework, which uses a specialized evaluator VLM to provide dense step-level feedback and combines optimization with the PGPO algorithm. It allows multimodal large models to move from black-box pattern matching to verifiable logical deduction, achieving the best performance among open-source models on complex table reasoning benchmarks. This framework marks a major shift in the multimodal reasoning paradigm from black-box pattern recognition to transparent, verifiable logical deduction.