Reading

CVE-LMTune: A Vulnerability Classification Framework for Multi-Taxonomy Systems Based on Hierarchical Fine-Tuned Language Models

This article introduces the CVE-LMTune framework, which enables automated vulnerability annotation for three major security taxonomies—MITRE ATT&CK, CWE, and CAPEC—using hierarchical cascading strategy and shared embedding technology. It achieves weighted F1 scores of 90%-93% on the SecureBERT model.

漏洞分类MITRE ATT&CKCWECAPECSecureBERT多标签分类层次级联网络安全语言模型微调

Published 2026-03-29 02:17Recent activity 2026-03-29 02:18Estimated read 7 min

CVE-LMTune: A Vulnerability Classification Framework for Multi-Taxonomy Systems Based on Hierarchical Fine-Tuned Language Models

Section 01

[Introduction] Core Introduction to the CVE-LMTune Framework

This article introduces CVE-LMTune—a vulnerability classification framework for multi-taxonomy systems based on hierarchical fine-tuned language models—aimed at automating the annotation of vulnerability descriptions into three authoritative security taxonomies: MITRE ATT&CK, CWE, and CAPEC. Using a hierarchical cascading strategy and shared embedding technology, the framework achieves weighted F1 scores of 90% for CWE, 92% for CAPEC, and 93% for MITRE ATT&CK on the SecureBERT model, effectively addressing the issues of class imbalance and large label space in multi-label classification.

Section 02

Background and Challenges

With the evolution of cybersecurity threats, the number of newly disclosed vulnerabilities is growing rapidly. However, vulnerability descriptions are mostly unstructured text, making them difficult to directly use in security operations. The industry relies on taxonomies like MITRE ATT&CK (Attack Tactics and Techniques), CWE (Common Weakness Enumeration), and CAPEC (Common Attack Pattern Enumeration and Classification) to improve management efficiency, but manual mapping has problems of high complexity and long time consumption. Additionally, vulnerabilities often involve multiple labels, with a large label space and class imbalance, and traditional machine learning and general large language models have limited performance in handling such extreme multi-label tasks.

Section 03

Three-Stage Design of the CVE-LMTune Framework

The CVE-LMTune framework consists of three stages: 1. Data Pipeline: Automatically integrate vulnerability information from multiple sources to build an annotated dataset covering multiple taxonomies; 2. Standardized Fine-Tuning and Evaluation Protocol: Specifically address the issue of extreme multi-label imbalance; 3. Hierarchical Cascading Architecture: Decompose the large classification space into smaller subproblems, gradually refine labels following the hierarchical structure of the taxonomy, and reduce learning difficulty.

Section 04

Model Selection and Experimental Results

Experimental comparisons show that fine-tuned encoder models (e.g., BERT series) are significantly better than generative models. On SecureBERT (a cybersecurity-optimized BERT variant), the hierarchical cascading strategy shows obvious improvements over flat classification: weighted F1 reaches 90% for CWE (12% improvement), 92% for CAPEC (8% improvement), and 93% for MITRE ATT&CK (12% improvement). This indicates that using the hierarchical structure of taxonomies can effectively improve the performance of fine-grained categories.

Section 05

Core Innovations: Hierarchical Cascading and Shared Embeddings

The core innovations of CVE-LMTune include: 1. Hierarchical Cascading Architecture: Decompose decisions according to the natural structure of the taxonomy (e.g., for CWE, first determine the major category then refine subcategories), using divide-and-conquer to reduce the complexity of subtasks; 2. Shared Embedding Mechanism: Classifiers at different levels share the underlying text representation, requiring only the addition of lightweight classification heads, making the computational overhead of hierarchical reasoning close to that of flat models and improving deployment feasibility.

Section 06

Practical Application Value

The application value of CVE-LMTune is reflected in: 1. Security Vendors/Vulnerability Databases: Reduce manual annotation costs and shorten the time window from vulnerability disclosure to classification; 2. Enterprise Security Teams: Achieve accurate priority ranking and correlation analysis through standardized labels, quickly identifying high-risk vulnerabilities; 3. Robustness: Shows good generalization ability on zero-day vulnerabilities and emerging threat patterns, and can handle vulnerability types not seen during training.

Section 07

Open Source Ecosystem and Future Outlook

CVE-LMTune has been open-sourced, providing reproducible baselines and tools for the community to promote the standardization of vulnerability classification. Future directions include: combining the semantic understanding of generative models with the classification stability of encoder models; exploring cross-language vulnerability classification and multi-modal analysis (code snippets, PoC videos, etc.); designing dedicated architectures and strategies for security domain needs to improve accuracy and interpretability.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

Folkering OS is the world's first AI-native bare-metal operating system, entirely written in Rust no_std without relying on Linux, POSIX, or libc. It can generate commands from scratch, compile them into WASM, and run them in 10 seconds, achieving true self-evolution.

Recent activity 2026-04-09 16:15