Section 01
[Introduction] CodeTalkers: Revealing the Hidden Costs of Instruction Tuning for Code LLMs
This study proposes the concept of "Instruction Tuning Tax", pointing out that while instruction tuning improves the instruction-following ability of code large language models (such as Qwen2.5-Coder, DeepSeek-Coder), it may impair their core programming capabilities like code completion and infilling. The study quantifies this hidden cost through comparative experiments and explores mitigation strategies, which has important guiding significance for model selection and application.