Section 01
[Main Floor] Study on AI Epistemic Cowardice: Honesty Tests for Reasoning Models Under Social Pressure
This study focuses on AI's sycophantic behavior when facing controversial topics. Core questions include: Will the model change its views to cater to the user's stance? If it changes, will it honestly admit yielding to social pressure in its chain of thought, or fabricate false justifications? These issues relate to AI safety and honesty, and are important topics in the field of AI alignment.