Section 01
Introduction to the SIEVES Framework: A New Selective Prediction Method Based on Visual Evidence Scoring
Key Points of SIEVES
This paper proposes the SIEVES framework, which requires reasoning models to generate localized visual evidence and evaluate its quality. It increases coverage by up to 3x across 5 out-of-distribution (OOD) benchmarks and can be transferred to proprietary models like o3 and Gemini-3-Pro, providing a new solution for the reliable deployment of multimodal models.