Section 01
Apollo: Efficient Training of Multimodal Models via Spatiotemporal Resource Reuse (Introduction)
Apollo is an innovative multimodal model training system. To address the low GPU resource utilization issue in multimodal model training, it proposes spatiotemporal resource reuse technology, allowing multiple MM modules to run simultaneously on the same GPU and enabling parallel computing through fine-grained resource quota control. While maintaining training quality, it can achieve up to 1.31x training acceleration, effectively optimizing memory and computing resource utilization.