Section 01
Introduction: Core Overview of the Shibboleth-Bench Benchmark
This article introduces Shibboleth-Bench—a visual anomaly detection benchmark project designed for large multimodal models, aiming to evaluate models' true visual understanding capabilities rather than superficial imitation. By constructing visual samples with subtle anomalies, this benchmark distinguishes whether models truly understand the physical, logical, and semantic rules of scenes, which is of great value for the research, development, and application of multimodal models.