Section 01
[Introduction] Sherlock: Open-Source Self-Correcting Reasoning Framework for VLMs (NeurIPS 2025 Accepted)
Sherlock is the first framework to enable intrinsic self-correcting capabilities in vision-language models (VLMs). Its paper has been accepted by NeurIPS 2025 and open-sourced. The framework achieves significant improvements on multiple benchmarks with only 20K samples. Author: DripNowhy. Project repository link: https://github.com/DripNowhy/Sherlock. Paper link: http://arxiv.org/abs/2505.22651. Released on June 4, 2026.