Section 01
Introduction: The Perception-Judge Framework Addresses Perceptual Judgment Bias in Multimodal LLM Judges
The KAIST research team proposes the Perception-Judge framework, which effectively mitigates the perceptual judgment bias of multimodal large models when acting as judges by constructing the Perceptual Perturbation Dataset (PPJD) and using GRPO reinforcement learning + batch ranking reward training. This framework improves the perceptual fidelity, ranking consistency, and human alignment of judgments, and has open-sourced the dataset, models, and code resources.