Section 01
[Introduction] AEGIS Benchmark: A New Evaluation Framework for Forensic Analysis of AI-Generated Academic Images
The rapid development of generative AI technology has triggered an academic image integrity crisis. Researchers have launched the AEGIS Benchmark, which systematically evaluates the academic image forensics capabilities of 25 multimodal large language models (MLLMs) and 9 expert models through three key innovations: domain-specific complexity, diverse forgery simulation, and multi-dimensional forensic assessment. It reveals that current forensic technologies are significantly lagging behind the development of generative AI.