Section 01
Introduction: InterleaveThinker—Empowering Interleaved Text-Image Generation with Multi-Agent Reinforcement Learning
This paper proposes the InterleaveThinker multi-agent pipeline, which enables existing image generators to perform interleaved text-image generation through collaboration between a planner agent and a critic agent. Using GRPO reinforcement learning for step-level instruction correction, it achieves performance comparable to Nano Banana and GPT-5 on interleaved generation benchmarks while significantly improving performance on reasoning benchmarks. This research is from arXiv (published in June 2026), with the original title "InterleaveThinker: Reinforcing Agentic Interleaved Generation".