Section 01
NCERTQABench: A Large-Scale Bilingual QA Dataset for Indian Education AI
Large language models are increasingly widely used globally, but how do they perform in non-English environments and on domain-specific knowledge? The education sector is an important test scenario—it not only tests the model's knowledge reserve but also its depth of understanding of curriculum content. The NCERTQABench project was built to address this need: a large-scale bilingual QA dataset rooted in India's school curriculum system, providing valuable resources for evaluating and optimizing educational AI.