Section 01
[Introduction] The Double-Edged Sword of Model Compression: Security Concerns Behind Efficiency Gains
Model compression technology is a necessity for the deployment and inference of trillion-scale large language models (LLMs), as it significantly reduces computational costs. However, it also poses security risks related to fairness, robustness, and trustworthiness. This article systematically categorizes the types of security risks introduced by compression, analyzes their underlying mechanisms, and explores evaluation frameworks and mitigation strategies, aiming to balance the trade-off between efficiency and security.