Section 01
CoCoReviewBench: Introduction to the New AI Reviewer Evaluation Benchmark
This article introduces CoCoReviewBench, a new evaluation benchmark for AI review systems. By focusing on completeness and correctness rather than simple text overlap with human reviews, it addresses the core issues in current AI review assessment and builds a reliable evaluation system based on 3900 papers from ICLR and NeurIPS. This benchmark provides a new evaluation paradigm for the development of AI review technology.