Section 01
Citation Dilemma of LLM Deep Research Agents: Cited but Unverified (Introduction)
This article focuses on the citation reliability issue of LLM-driven deep research agents. The first systematic evaluation framework reveals: even the strongest models have a factual accuracy rate of only 39-77%, and more retrieval does not mean more accurate citations. This article will discuss from the background, evaluation framework, findings, analysis, and future directions.