Section 01
RouterGym: Guide to the Agent Benchmark Framework for SLM Replacement of LLM
RouterGym is a benchmark framework for evaluating the feasibility of small language models (SLMs) replacing large language models (LLMs) in Agent tasks. The project implements a routing-memory co-design, supports multiple routing strategies, memory systems, and contract validation, and provides empirical evidence for SLM-led Agent architectures through comprehensive cost, quality, and latency trade-off analysis.