Section 01
DualGeo: A Dual-Perspective Framework to Improve Global Image Geolocation Accuracy
This paper proposes the DualGeo two-stage framework, which fuses image and semantic segmentation features via bidirectional cross-attention, combined with geographic clustering reordering and LMM reasoning. It improves street-level (<1km) and city-level (<25km) geolocation accuracy by 3.6%-16.58% and 1.29%-8.77% respectively on the IM2GPS, IM2GPS3k, and YFCC4k benchmarks, providing a new approach for global image geolocation.