Section 01
[Introduction] CLAS: Context-Aware Activation Steering for Precise Behavior Regulation of Large Models
CLAS (Contextual Linear Activation Steering) is a context-aware linear activation steering method that solves the problem of inconsistent performance of fixed-strength steering across different inputs by dynamically adjusting steering intensity. It outperforms standard methods on 11 steering benchmarks and 4 model families, is comparable to ReFT and LoRA but more interpretable, and is lightweight and efficient—providing a powerful tool for precise behavior regulation of large models.