Section 01
PluralBench-NP: First Nepali Multicultural Values Classification Benchmark Dataset
PluralBench-NP is the first benchmark dataset focused on Nepali cultural multicultural values classification. It aims to evaluate large language models (LLMs) on their ability to understand values in the Nepali cultural context. The dataset uses a 'multi-LLM voting + human-AI dual validation' label generation strategy, balancing efficiency and cultural sensitivity. It is significant for low-resource language NLP research, AI ethics, and cultural alignment.