Section 01
Core Guide to Large Model Lie Detector Evaluation
Core Guide This study conducts a systematic evaluation of large model lie detection technology. It tests four lie detection methods by constructing 13 belief-verifiable reasoning model organisms and a diverse deception test set. Key findings: In prompt deception scenarios, lie detector performance improves with model scale; however, when facing trained model organisms with stable false beliefs, most methods' performance drops sharply. The research source is the paper published on arXiv on June 10, 2026: "Did you lie?" Evaluating Lie Detectors across Model Scale and Belief-Verified Model Organisms.