Section 01
Sink-Probe: Guide to Cutting-Edge Research on Hallucination Detection in Large Language Models Based on Attention Sinks
Sink-Probe is an open-source project from the Graph Machine Learning Lab at Wroclaw University of Science and Technology in Poland, serving as the official implementation of the paper 'Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models'. By analyzing the sink phenomenon in the Transformer attention mechanism, this project detects hallucinatory content in model outputs without relying on external validation. It has advantages such as real-time performance and interpretability, representing a cutting-edge direction in the research of large language model interpretability.