Section 01
[Introduction] ByteDance Lance: A 3B-Parameter Unified Multimodal Model Balancing Efficiency and Multi-Task Capability
ByteDance has launched Lance, a lightweight natively unified multimodal model. With only 3 billion active parameters, it achieves strong performance across multiple tasks including image generation, editing, video generation, and understanding. The model adopts a phased multi-task training strategy and was trained from scratch within the budget of 128 A100 GPUs, providing new possibilities for efficient deployment of multimodal AI.