Section 01
[Introduction] Economic Trade-off of Large Model Distillation Strategies: A Comparative Study of Reasoning-Trace and Answer-Only Distillation
This study systematically compares the economic efficiency and performance of reasoning-trace distillation and answer-only distillation in Transformer language models, aiming to provide a quantitative decision-making basis for model compression and edge deployment. The two strategies differ significantly in training cost, inference performance, and final effect. This project constructs a decision framework through systematic evaluation to help practitioners balance and choose.