Section 01
Zero Model: Introduction to the Small Open-Source Model Family Focused on Security Reasoning
Zero is an open-source family of small language models specifically trained to reason about security issues directly, just like senior security researchers. Addressing the pain point of large language models giving ambiguous responses when handling security problems, it adheres to the core philosophy of "no avoidance, no whitewashing" and strives to provide direct and accurate answers in the security domain. The project explores the minimal model size required for true security reasoning and the transferability of capabilities. Training data comes from CTF competition challenges, and it uses GRPO (Generalized Reward Policy Optimization) adversarial self-play training.