Section 01
[Introduction] Core Overview of Research on Conflict-Aware Reasoning in Clinical Vision-Language Models
This study focuses on conflict detection mechanisms in medical vision-language models (VLMs), proposing Defer Gate to identify discrepancies between image-only predictions and predictions combining images and laboratory data, aiming to enhance the reliability of models in clinical decision-making. Addressing the risk of misdiagnosis caused by multi-modal information conflicts, the study explores methods to enable medical VLMs to have conflict-aware capabilities.