Section 01
Introduction: O'Reilly Course Guides You to Build Reasoning Models from Scratch and Deeply Understand Core Mechanisms
This hands-on course from O'Reilly helps learners deeply understand the working principles of modern reasoning models (such as o1, DeepSeek R1, Gemini 2.0) by building a DeepSeek R1-style reasoning model training process from scratch. It covers core key technologies like Chain of Thought (CoT) and GRPO reinforcement learning. The course emphasizes practicality, allowing learners to fully master the reasoning model building process from theory to code.