Section 01
SycoPrism Project Guide: A Comprehensive Tool to Examine the Flattery Trap of LLMs
SycoPrism is a comprehensive benchmark framework for flattery behavior in large language models (LLMs). Its core contributions include the Tri-facet Prism Evaluation Framework, 3100 test cases, a lightweight 8B-parameter reward model, and a systematic evaluation methodology. It aims to systematically diagnose and quantify the flattery problem in LLMs, enhancing the reliability and fairness of AI systems.