Section 01
Urban-WORM: Introduction to the Multimodal Model-Driven Intelligent Annotation Tool for Crowdsourced Geospatial Data
Urban-WORM is an open-source multimodal inference workflow framework that focuses on generating rich and interpretable automatic annotations for geotagged crowdsourced image data, suitable for urban research, geographic information systems (GIS), and spatial data analysis scenarios. It aims to address the high cost and scalability challenges of traditional manual annotation, providing a user-friendly interface that allows users to build image understanding pipelines without deep knowledge of model details.