Section 01
【Introduction】CoRD: Core Breakthroughs in Multi-Teacher Collaborative Distillation for Long Chain-of-Thought Reasoning
CoRD (Collaborative Reasoning Distillation) is an innovative framework for distilling Long Chain-of-Thought (Long-CoT) reasoning. Through multi-teacher collaborative stepwise decoding, combined with perplexity scoring and beam search, it addresses issues in existing distillation methods such as blindness, lack of dynamic exploration, and missed complementary reasoning. It reduces redundant sampling while maintaining reasoning quality, enabling student models to achieve performance close to that of teacher models.