Section 01
ATLAS Framework: A New Paradigm Unifying Agentic and Implicit Visual Reasoning with Functional Tokens
The ATLAS framework is a new visual reasoning paradigm proposed by institutions including the Chinese University of Hong Kong and Shanghai Artificial Intelligence Laboratory. Its core innovation is unifying agentic reasoning and implicit visual reasoning into a single discrete token via functional tokens. This design eliminates the external execution latency of agentic reasoning while retaining interpretability; it also introduces the LA-GRPO algorithm to solve the sparsity problem in functional token training, achieving a win-win between performance and interpretability.