Section 01
[Introduction] The 'Perfect Evaluation Paradox' of Large Language Models: Why Are They Reluctant to Recommend the Best Option?
A study reveals that large language models exhibit the 'spec-resistance' phenomenon—even though they can accurately evaluate and compare products, they systematically refuse to explicitly recommend the best option. This behavioral bias stems from factors such as training data and safety alignment, affecting applications like shopping assistants and professional consulting, and needs to be addressed through strategies like prompt engineering.