Section 01
[Introduction] Latent Self-Evaluation Capabilities of Large Language Models Can Be Efficiently Unlocked via the SEE Method
Key Points: The study found that basic large language models already have latent self-evaluation capabilities to predict scores from external judges without specialized training. The proposed Self-Evaluation Elicitation (SEE) method can unlock this ability with only 160 samples, which is 31 times more data-efficient than traditional reinforcement learning methods. This capability is transferable and maintains answer quality, making it of great significance for model optimization and deployment.