Section 01
[Introduction] CrescendoGuard: An LLM Security Defense Framework Against Multi-Turn Jailbreak Attacks
CrescendoGuard is a reproducible defense framework against Crescendo-style multi-turn dialogue jailbreak attacks, protecting LLMs through a multi-layer mitigation pipeline and cumulative risk scoring mechanism. Built on Llama 3.2 3B Instruct, the framework supports a DryRun simulator (for reproducible benchmarking) and real model clients. It is open-source and reproducible, providing a defense approach of "full dialogue trajectory monitoring" for AI security.