Section 01
导读 / 主楼:Activation Vector Steering: Precisely Controlling Large Language Model Behavior via Representation Engineering
Introduction / Main Post: Activation Vector Steering: Precisely Controlling Large Language Model Behavior via Representation Engineering
Activation steering technology controls model behavior by adding guiding vectors to the internal activations of large language models during inference, providing a powerful tool for research on model interpretability and controllability. This article introduces two implementation paths and their applications.