Section 01
FAST: Fast-Slow Thinking with GRPO Boosts VLM Reasoning (NeurIPS 2025 Spotlight)
FAST is an innovative fast-slow thinking training method that enhances the reasoning capabilities of large vision-language models (VLMs) using the GRPO reinforcement learning framework. This project has received Spotlight recognition at NeurIPS 2025. Its core lies in introducing the dual-process theory from cognitive science, enabling the model to dynamically select thinking modes and optimize reasoning decisions in combination with the GRPO framework, aiming to address the insufficient deep reasoning capabilities of VLMs.