Section 01
【Introduction】Vision-Guided Iterative Optimization: A New Paradigm for Frontend Code Generation
This paper proposes a new vision-guided paradigm for frontend code generation, with the core being the use of vision-language models (VLMs) as automatic evaluators to provide structured feedback, enabling iterative optimization of frontend code. This method achieves a 17.8% performance improvement on the WebDev Arena dataset, and internalizes some evaluation capabilities into the code generation model via LoRA fine-tuning, reducing reliance on multi-round reasoning.