Section 01
Yale NLP Open-Sources faithful_lrm Framework, Focusing on Evaluating the Faithfulness of Confidence Expressions in Large Reasoning Models
Yale University's NLP Lab has open-sourced the faithful_lrm project, proposing a systematic framework to evaluate whether the confidence expressions of Large Reasoning Models (LRMs) in chain-of-thought reflect their internal uncertainty truthfully, and revealing key challenges in confidence calibration for current reasoning models. The framework aims to enhance the reliability and safety of AI systems.