Section 01
Omni123: Using 2D Data to Compensate for 3D Scarcity, A Native Model Unifying Text-to-2D and Text-to-3D Generation
Omni123 proposes a 3D-native foundation model that unifies text-to-2D and text-to-3D generation and addresses the problem of 3D data scarcity. It does this by representing text, images, and 3D as discrete tokens in a shared sequence space, and using 2D data as geometric priors to improve 3D representations.