Section 01
[Introduction] Study on Data Organization Strategies for Multimodal Instruction Tuning: Curriculum Training Performs Best
This article explores the impact of data organization order on capability trade-offs in Multimodal Large Language Models (MLLMs) training. By comparing four training strategies (direct mixing, curriculum training, balanced sampling, reverse curriculum), it finds that data scheduling should be regarded as a first-order design variable for multimodal model adaptation, and the curriculum training strategy performs best in structured reasoning, providing important guidance for multimodal model training.