Section 01
Complete Open-Source Large Model Post-Training Framework: Introduction to the open-posttraining-system Project
Large language model training consists of two phases: pre-training and post-training. Post-training is the key link that determines whether a model can meet actual application requirements. The open-source project open-posttraining-system provides a complete post-training engineering framework covering supervised fine-tuning (SFT), preference optimization, reinforcement learning (including RLHF), reasoning ability cultivation, evaluation system, and scalable inference system, filling the gap in the open-source community's lack of systematic post-training implementation.