Section 01
[Introduction] R³ Loop: Enabling Self-Reflection and Correction in AI Image Generation
The CUHK team proposes the Reason-Reflect-Rectify (R³) framework, breaking through the bottleneck of the single-generation paradigm in text-to-image (T2I) models; constructs the R³-Bench evaluation benchmark to reveal the capability gap of current models—"can identify problems but cannot correct them"; and presents the R³-Refiner two-stage optimization framework, which achieves a 12% increase in reflection judgment score and a 9% increase in correction score, while also having cross-model compatibility.