Section 01
[Introduction] Research on Diagnosis and Mitigation of Modality Interference in MLLMs
This article conducts research on the modality interference problem in Multimodal Large Language Models (MLLMs), proposing a perturbation-based causal diagnosis method and a consistency regularization fine-tuning framework, which effectively improves the model's unimodal robustness and cross-modal capabilities.