Section 01
[Introduction] Perceptual Diversity in Multilingual VLMs and Exploration of Multimodal Redescription Framework
The AACL-IJCNLP 2025 research project focuses on the problem of perceptual diversity in multilingual vision-language models (VLMs) — systematic differences exist in how speakers of different languages describe the same visual content, which poses challenges to model fairness and accuracy. The project proposes a multimodal redescription framework, aiming to enhance the cross-lingual capabilities of VLMs by introducing an intermediate redescription step, balancing technical improvements with cultural respect.