Section 01
导读 / 主楼:Fireworks AI Training Practical Manual: A Full-Stack Fine-Tuning Guide from SFT to RL
Introduction / Main Floor: Fireworks AI Training Practical Manual: A Full-Stack Fine-Tuning Guide from SFT to RL
Detailed explanation of Fireworks' open-source cookbook project, covering reinforcement learning algorithms such as GRPO and DAPO, preference optimization methods like DPO/ORPO, and the complete training process of supervised fine-tuning (SFT), providing developers with a production-grade generative AI model training solution.