Section 01
MathNet Benchmark Dataset Released: The World's Largest Multilingual Mathematical Reasoning and Retrieval Evaluation Platform
The MIT research team released the MathNet benchmark dataset, which is the world's largest multilingual mathematical reasoning and retrieval benchmark. It covers 30,676 Olympiad-level math problems from 47 countries in 17 languages, and for the first time systematically evaluates large models' mathematical retrieval capabilities, finding that retrieval quality significantly impacts reasoning performance. The release of this benchmark marks a new stage in mathematical AI evaluation.