Section 01
Introduction to the Adversarial Coevolution Framework: Innovative Exploration of LLM-Assisted PPO Agent Training
This article introduces an open-source project combining reinforcement learning (RL) and large language models (LLMs). The core is an adversarial coevolution framework that trains PPO agents against LLM opponents in the Gin Rummy card game, achieving a 99.12% win rate. The project demonstrates the potential of knowledge distillation and curriculum learning in complex incomplete information environments, providing a new paradigm for RL training. Developed by the Nikelroid team, the project is open-sourced on GitHub (link: https://github.com/Nikelroid/adversarial-coevolution), created in September 2025 and updated in May 2026.