Section 01
[Introduction] Visual Traps of Multimodal Large Models: ACL 2026 Study Reveals Misleading Chart Attacks and Defenses
The ACL 2026 main conference paper study found that multimodal large language models (MLLMs) see their accuracy plummet to random levels when faced with misleading charts, with a maximum drop of 65.5 percentage points. The research team proposed six inference-time correction methods, with the best solution improving accuracy by 19.6 percentage points.