Section 01
Uni-Edit: Unifying the Three Capabilities of Multimodal Models via a Single Intelligent Editing Task
Uni-Edit proposes intelligent image editing as a universal task, which can simultaneously enhance the three capabilities (understanding, generation, and editing) of multimodal models using only a single dataset, breaking the trade-off dilemma of multi-task training.