Section 01
【Introduction】Retrieval Enhancement + Reliability Awareness: A New Framework to Mitigate Multimodal Visual Hallucinations
This paper proposes a retrieval-enhanced reliability-aware reasoning framework aimed at addressing the visual hallucination problem in multimodal systems. The core idea is to construct an external visual evidence database, combine multiple reliability metrics to evaluate prediction credibility, and dynamically adjust output strategies through a decision gating mechanism. This method can effectively improve prediction accuracy and reduce hallucination rates without retraining large multimodal models, providing more reliable solutions for key scenarios such as medical imaging and autonomous driving.