Section 01
CG-MLLM Project Guide: Multimodal Large Language Models Empower 3D Content Understanding and Generation
CG-MLLM is a research project accepted by ICML 2026, with the core goal of exploring how to use multimodal large language models to achieve automatic captioning and generation of 3D content. This project bridges text, images, and the 3D world, providing a new technical path for the intelligent processing of 3D content.