Section 01
[Introduction] Activation Steering: A New Method to Enhance Physical Reasoning Capabilities of LLM Without Retraining
This article introduces a technique called "Activation Steering", which dynamically adjusts hidden states during model inference to significantly improve the performance of large language models on physical problems without retraining. This technique has the advantage of being non-intrusive. The EquiCaste project on GitHub (llm-steering-vectors-for-physics), using the Qwen3.5-0.8B model, verified its effectiveness on the MMLU-Pro physics benchmark, providing a lightweight solution for optimizing specific capabilities of LLMs.