Section 01
SAMA Dataset: A New Benchmark for Evaluating Spatial Reasoning Capabilities of Vision-Language Models on Non-Standard Guide Maps
The SAMA dataset, released by the University of California, Riverside, is the first large-scale visual question-answering benchmark targeting non-standard attraction guide maps. It includes 49 real-scene guide maps (covering 6 categories such as theme parks, zoos, and resorts) and 4296 manually verified question-answer pairs, aiming to fill the gap in evaluating the spatial reasoning capabilities of existing Vision-Language Models (VLMs) on non-standard maps.