Section 01
RAG-driven Prompt Decomposition Image Editing System: Innovative Integration of Multimodal LLM and Diffusion Models (Introduction)
This article introduces a new image editing method based on Retrieval-Augmented Generation (RAG), which achieves context-aware intelligent image editing through prompt decomposition, FAISS vector retrieval, and diffusion models. This method innovatively integrates RAG technology into the image editing process, solving the core challenge in existing text-to-image editing of maintaining key features of the original image while accurately executing editing instructions, and providing a new technical paradigm for the AIGC field. The original author of the project is bidisha1005, source platform GitHub, original title prompt_controlled_image_editing, link https://github.com/bidisha1005/prompt_controlled_image_editing, release time 2026-05-25T16:26:08Z.