Section 01
New Framework for Credibility Assessment of Dental AI: Cross-Dataset Calibration of Multimodal Large Language Models and Confidence-Based Triage Mechanism
This article introduces the mats_dental_triage project, a credibility assessment and calibration framework for AI diagnostic systems in oral diseases. The project addresses the reliability issues of multimodal large language models (MLLMs) in dental image triage through modality-aware temperature scaling, confidence-weighted integration, and selective referral mechanisms, and has validated its effectiveness on five real-world datasets. The project was developed by teams including Chen Peng, Shi Chuyan, and Wei Bo, with the source available on GitHub, and the related findings are intended to be submitted to npj Digital Medicine.