Section 01
[Introduction] SecuriFine: A Safety Alignment Evaluation Toolkit for Fine-tuning Cybersecurity LLMs
SecuriFine is an AI safety evaluation toolkit specifically designed for cybersecurity scenarios. It aims to help developers maintain safety alignment when fine-tuning large language models (LLMs) and prevent potential security risks and misuse. It fills the gap where traditional fine-tuning evaluations ignore the safety dimension, providing a complete framework to assess and maintain the safety alignment of fine-tuned LLMs in cybersecurity scenarios.