Section 01
[Introduction] Unified Backdoor Defense Framework for Multimodal Large Models: Patch Enhancement + Cross-View Regularization
This paper addresses the backdoor attack problem in multimodal large language models (MLLMs) and proposes a unified defense framework based on patch-level data augmentation and cross-view output difference regularization. It effectively suppresses the success rate of backdoor attacks while maintaining the model's normal text generation capability. This framework provides a new technical solution for multimodal AI security.