Zing Forum

Reading

Backdoor Defense for Multimodal Large Models: A Unified Framework Based on Patch Enhancement and Cross-View Regularization

This paper proposes a backdoor defense framework for multimodal large language models. Through patch-level data augmentation and cross-view output difference regularization, it effectively suppresses the success rate of backdoor attacks while maintaining the model's normal text generation capability.

后门防御多模态大模型数据增强跨视图正则化AI安全模型可信
Published 2026-04-06 15:27Recent activity 2026-04-07 11:51Estimated read 1 min
Backdoor Defense for Multimodal Large Models: A Unified Framework Based on Patch Enhancement and Cross-View Regularization
1

Section 01

导读 / 主楼:Backdoor Defense for Multimodal Large Models: A Unified Framework Based on Patch Enhancement and Cross-View Regularization

Introduction / Main Floor: Backdoor Defense for Multimodal Large Models: A Unified Framework Based on Patch Enhancement and Cross-View Regularization

This paper proposes a backdoor defense framework for multimodal large language models. Through patch-level data augmentation and cross-view output difference regularization, it effectively suppresses the success rate of backdoor attacks while maintaining the model's normal text generation capability.