LoRA: Lightweight Character Memory
LoRA is a parameter-efficient fine-tuning technique that trains only a small number of additional parameters (about 1% of the original model). By training a Personal LoRA using reference images of the target character, it stably reproduces key visual elements such as the character's facial features, hairstyle, and clothing style.
ControlNet: Precise Pose Control
It controls diffusion model generation through additional conditional inputs, using multiple variants: Canny edge detection (ensures consistent composition), OpenPose (precise pose transfer), and Depth (controls spatial relationships and occlusion), enabling pose specification while maintaining the character's identity.
ComfyUI: Visual Workflow Orchestration
The node-based design breaks down the generation process into key steps: reference image input, pose image input, ControlNet processing, LoRA loading, text prompts (scene/lighting/style), diffusion sampling, and SwinIR 4x super-resolution upscaling.