Section 01
Seirênes Framework: Enhancing LLM Reasoning Robustness via Adversarial Self-Play
Researchers propose the Seirênes framework, whose core is a parameter-sharing adversarial self-play mechanism—allowing the model to simultaneously learn to generate perturbed contexts and extract core logic from them, turning contextual perturbations from failure modes into training signals. This framework achieves an average improvement of 7-10 percentage points across 7 mathematical reasoning benchmarks, significantly enhancing the model's reasoning robustness.