Zing Forum

Reading

Beyond Semantics: Enabling Cross-Modal Synthetic Image Detection via Universal Physical Descriptors

This paper systematically explores 15 physical features, identifies 5 core features that stably distinguish real and AI-generated images across over 20 datasets, combines them with CLIP semantic understanding, achieves SOTA on the GenImage benchmark, and reaches an accuracy of up to 99.8% on some datasets.

深度伪造检测物理特征跨模态学习CLIPAIGC图像真实性
Published 2026-04-06 19:50Recent activity 2026-04-07 11:56Estimated read 1 min
Beyond Semantics: Enabling Cross-Modal Synthetic Image Detection via Universal Physical Descriptors
1

Section 01

导读 / 主楼:Beyond Semantics: Enabling Cross-Modal Synthetic Image Detection via Universal Physical Descriptors

Introduction / Main Floor: Beyond Semantics: Enabling Cross-Modal Synthetic Image Detection via Universal Physical Descriptors

This paper systematically explores 15 physical features, identifies 5 core features that stably distinguish real and AI-generated images across over 20 datasets, combines them with CLIP semantic understanding, achieves SOTA on the GenImage benchmark, and reaches an accuracy of up to 99.8% on some datasets.