Section 01
[Introduction] mllm-jailbreak-bench: A Key Tool for Safety Evaluation of Multimodal Large Language Models
mllm-jailbreak-bench is an open-source security evaluation benchmark tool for Multimodal Large Language Models (MLLMs). It provides a systematic and reproducible adversarial attack testing framework covering five main attack categories, helping researchers and developers detect model security vulnerabilities. It fills the gap in safety evaluation for multimodal models and promotes the shift of AI safety testing from unsystematic to standardized processes.