Section 01
Introduction: BAS—A New Decision-Theoretic Approach for Confidence Evaluation of Large Language Models
BAS (Behavioral Alignment Score) is a new decision-theoretic metric for LLM confidence evaluation. Addressing the flaw of traditional evaluations that fail to consider "answer or abstain" decisions, it uses an asymmetric penalty mechanism to prioritize avoiding overconfidence errors. The study reveals that cutting-edge models still have severe overconfidence issues, and simple interventions (such as Top-k guidance and post-hoc calibration) can effectively improve reliability, providing a more practical evaluation standard for LLM applications in high-risk scenarios.