Section 01
[Introduction] Core Summary of Research on the Activation Mechanism of Emotional Concepts in Open-Source Large Language Models
Based on Anthropic's research framework, this article analyzes the emotional concept activation mechanisms of five open-source large language models (Qwen, Mistral, Falcon, Zephyr, and OpenChat) using a paired emotional detection method, revealing significant differences in emotional processing among different models. The study found that all models exhibit emotional polarization (higher activation levels for negative/high-arousal emotions), obvious differences in activation intensity between models, and the top three emotions are concentrated in negative/high-arousal types such as fear, love, and anger. These findings have practical guiding significance for model selection, bias mitigation, and prompt engineering optimization.