Section 01
[Introduction] SecPI: Let Reasoning Models Internalize Security Thinking, Bid Farewell to Code Security Vulnerabilities
In the era of AI-assisted programming, reasoning language models (RLMs) serve as helpful tools for developers, but the code they generate often contains security vulnerabilities. The research team proposes the SecPI method, which allows models to internalize structured security reasoning as default behavior through fine-tuning, enabling them to generate secure code without security prompts during inference. Experiments show that the QwQ 32B model's secure code generation rate increased by 14 percentage points, and it has generalization capabilities across vulnerability types and programming languages.