Section 01
Introduction to CMC Firewall: A Conformal Prediction-Based Defense Against Visual Prompt Injection in Multimodal LLMs
With the widespread application of Multimodal Large Language Models (MLLMs), visual prompt injection attacks have become a severe security challenge. CMC (Conformal Cross-Modal Firewall) is a pre-model defense mechanism that effectively controls the false positive rate while maintaining model utility through OCR text extraction, SigLIP risk scoring, and inductive conformal prediction calibration, resolving the dilemma of traditional defenses being either 'overly sensitive with false positives' or 'too lenient with missed attacks'.