Section 01
[Introduction] PixDLM: A Dual-Path Multimodal Reasoning Segmentation Model for UAV Scenarios
PixDLM, a CVPR 2026 Highlight work proposed by the Xiamen University team, addresses challenges in UAV scenarios such as small objects, large field of view, and high scene complexity through decoupling semantic reasoning and pixel perception dual paths, achieving leading performance on the DRSeg benchmark. The work also releases the first UAV reasoning segmentation dataset DRSeg, and has open-sourced model weights, code, and the dataset, providing a new solution for UAV visual understanding.