Section 01
SinkProbe: A New Method for LLM Hallucination Detection Using Attention Sinks
A research team from Wroclaw University of Science and Technology in Poland proposed the SinkProbe method, which detects hallucinatory content by analyzing internal attention sinks of large language models. This method does not require external references, uses only statistical features of attention matrices to achieve efficient detection, and has performed excellently on multiple models and datasets. The related paper will be published at ICML 2026.