Section 01
Viewpoint-Aware 3D Referring Segmentation: Core Breakthrough in Resolving Spatial Relation Ambiguity
This paper focuses on the viewpoint ambiguity problem in 3D scene understanding and proposes the first viewpoint-aware 3D referring segmentation dataset (containing 220,000 benchmark samples). By explicitly encoding camera pose information, the segmentation accuracy of viewpoint-dependent spatial relations such as left/right and front/back is improved from 0.30 to 0.47, significantly enhancing the spatial understanding capability of 3D multimodal models.