Section 01
[Introduction] TurnBack Benchmark: Evaluating Geospatial Cognitive Ability of Large Language Models via Reverse Path Tasks
TurnBack is an innovative benchmark that assesses the geospatial reasoning and navigation cognitive abilities of large language models through reverse path tasks, revealing the strengths and limitations of current models in spatial understanding. This benchmark has been accepted by EMNLP 2025, with its core innovation lying in the adoption of the "reverse path" paradigm, which tests the model's ability to deeply understand spatial relationships. This article will discuss aspects such as background, methodology, experimental findings, error analysis, and future directions.