Section 01
【Introduction】EvoArena and EvoMem: A New Paradigm for Memory Evolution of LLM Agents in Dynamic Environments
To address the challenges of deploying in real-world dynamic environments, researchers have introduced the EvoArena benchmark suite and the EvoMem patch-based memory paradigm. Experiments show that current agents have an average accuracy of only 39.6% in dynamic environments, while EvoMem not only improves performance in dynamic environments but also enhances results on standard benchmarks, emphasizing the importance of modeling "evolution" in evaluation and memory.