Section 01
OAKS Benchmark: Evaluating LLMs' Online Adaptation to Continuous Knowledge Streams
The KAIST AI team released the OAKS benchmark (accepted to ACL 2026 Main), which is the first framework specifically designed to evaluate large language models (LLMs) online adaptation capabilities in dynamic, continuously updated knowledge streams. This benchmark simulates continuously incoming knowledge streams, testing whether models can track knowledge evolution in real time and adjust their responses. It includes both synthetic and real-world datasets, and open-source resources (datasets, evaluation code, etc.) to drive industry progress.