Section 01
[Introduction] LemonadeBench: A Benchmark for Evaluating the Economic Intuition of Large Language Models
LemonadeBench is a benchmark project dedicated to evaluating the economic intuition of large language models (LLMs), aiming to fill the gap in the assessment of LLMs' economic reasoning capabilities. Through the classic lemonade stand scenario, it tests models' reasoning abilities on core economic concepts such as supply-demand relationships, pricing strategies, and market dynamics, which is of great significance for evaluating models' practical reasoning skills.