Section 01
Aqal: Introduction to the World's First Urdu Reasoning-Optimized Large Language Model
Aqal is the world's first reasoning-optimized large language model specifically designed for Urdu. Through a three-stage training process (continuous pre-training, supervised fine-tuning, and GRPO reinforcement learning), it significantly improves multi-step reasoning, logical consistency, and the correctness of final answers, filling the gap in high-quality reasoning models for low-resource languages like Urdu.