Section 01
PluralValueBench: A Benchmark Tool for Evaluating LLMs' Understanding of Cultural Value Pluralism
PluralValueBench is a benchmark tool and dataset designed to evaluate whether large language models (LLMs) understand and respect value differences across different cultural backgrounds. Built on Schwartz's value theory and covering 8 major global cultural regions, it uses quantitative metrics (e.g., KL divergence) to compare model outputs with real human survey data, helping identify cultural biases in models and supporting AI ethics and cross-cultural deployment.