Section 01
Introduction / Main Post: CSAQ Quantization Framework: Protecting Large Model Reasoning Ability with Causal Salience Scoring
CSAQ is a post-training quantization method that identifies critical weights using causal importance scores (gradient × activation). It preserves model reasoning ability under 4-bit quantization and addresses the issue where 80% of critical weights are incorrectly quantized by methods like AWQ.