Section 01
BadT2I Research Guide: Backdoor Attacks Against Text-to-Image Diffusion Models
Core Points
- Paper Background: ACM MM 2023 Oral paper, open-source implementation (GitHub link: https://github.com/zhaisf/BadT2I)
- Attack Method: Implant backdoors in T2I diffusion models via multimodal data poisoning
- Attack Types: Supports three types: pixel-level, object-level, style-level
- Trigger Word: Uses hidden characters like zero-width space (\u200b)
- Model Basis: Research based on Stable Diffusion
This study reveals serious security threats to T2I models and aims to raise the community's awareness of model security.