Section 01
[Introduction] Research on Hallucination Evaluation of Multilingual Large Language Models from an Indian Language Perspective
This study conducts a systematic evaluation of the hallucination behaviors of three open-source large language models—Phi-4, Qwen, and LLaMA-2—across five major Indian languages (Hindi, Bengali, Telugu, Tamil, Malayalam). By integrating semantic evaluation and mechanistic interpretability techniques, it fills the gap in existing research on hallucination evaluation for low-resource languages and provides key insights for building more fair and reliable multilingual AI systems.