Section 01
Introduction: Chakma Machine Translation Project Provides Technical Example for Endangered Language Preservation
Over 40% of the world's languages are at risk of extinction. As an endangered language in the Eastern Bengal region, Chakma lacks digital resources, making it unsupported by mainstream machine translation systems. The Chakma Project, a master's program in data science at University College London (UCL), built the first Chakma-English word-level translation dataset and used QLoRA technology to fine-tune LLaMA 3.1 8B and Gemma3 4B models, achieving Chakma machine translation capability for the first time and providing a reference technical path for the digital preservation of endangered languages.