Section 01
[Introduction] FakeVLM: An Interpretable Multimodal Paradigm for Synthetic Image Detection (Accepted by NeurIPS 2025)
This article introduces the FakeVLM project accepted by NeurIPS 2025. Addressing two core challenges in synthetic image detection—detection models becoming obsolete due to rapid evolution of generation technologies, and lack of interpretability in black-box models—it proposes a new framework integrating interpretable multimodal vision-language models (VLM) and fine-grained artifact analysis. This framework not only determines the authenticity of images but also explains the reasoning in natural language, bringing breakthroughs to AI-generated image detection.