Section 01
Introduction: Core Findings of the First Systematic Comparative Study on Hallucination Issues in Diffusion LLMs
The first controlled comparative study on hallucination issues in diffusion large language models (dLLMs) reveals: current dLLMs are more prone to hallucinations than autoregressive (AR) models of the same scale, and have diffusion-specific failure modes such as early termination and incomplete denoising. This study fills the gap in research on dLLM faithfulness and provides directions for optimizing model reliability.