Section 01
ADB Framework: Measurement and Insights into LLM Safety Alignment Drift Under Quantization Compression
This article introduces the Alignment Drift Benchmark (ADB) framework, which is the first to quantify the impact of model compression techniques on the safety alignment capabilities of large language models (LLMs). The core viewpoint is: while model compression improves efficiency, it may compromise safety alignment. The ADB framework reveals this drift phenomenon through a dual-track evaluation system, providing a quantitative basis for deployment decisions in production environments, and emphasizing that efficiency optimization should not come at the cost of safety.