Section 01
【Introduction】Flow-OPD: Empowering Image Generation Models with LLM Policy Distillation Technology
Researchers applied the successful On-Policy Distillation (OPD) technology from the Large Language Model (LLM) field to Flow Matching image generation models, proposing the Flow-OPD framework. This framework addresses two core issues faced by Flow Matching models during the fine-tuning alignment phase: sparse rewards and gradient interference, and achieves significant performance improvements on Stable Diffusion 3.5, providing a new paradigm for multi-task alignment of image generation models.