Section 01
Introduction: Counterintuitive Discrete Probability Problem Dataset—A New Benchmark for AI Reasoning Evaluation
The research team has released a dataset of counterintuitive problems in discrete probability, including classic paradoxes, recreational math problems, and original designed questions, along with detailed solutions. This dataset aims to test whether large language models (LLMs) will make systematic cognitive bias errors similar to those made by humans, providing a new benchmark for evaluating AI reasoning capabilities. The dataset combines historical depth and innovative breadth; it is not only used for AI evaluation but also provides value for understanding AI cognitive characteristics and probability education.