Section 01
[Introduction] Study on the Potential and Limitations of Large Language Models in SOC Alert Processing
This study conducts an empirical investigation into the performance of mainstream large language models (LLMs) such as GPT-4o and DeepSeek in the tasks of alert classification and priority ranking in Security Operations Centers (SOCs). The results show that LLMs exhibit high recall potential in alert classification tasks but have a high false positive rate; their performance in priority ranking tasks is significantly insufficient. The research conclusion points out that AI should serve as an auxiliary tool for SOC analysts, and human-machine collaboration is needed to balance automation and manual judgment to improve operational efficiency.