Section 01
[Overview] Core Points of the Multi-Layer Adversarial Prompt Detection System
The Abinesh092 team proposes a multi-layer cascaded adversarial prompt detection system. Using a three-layer architecture of rule filtering, machine learning (TF-IDF + LightGBM), and semantic analysis (Sentence-BERT), it defends large language models against prompt injection and jailbreak attacks, addressing the limitations of single protection methods while balancing detection accuracy and real-time response.