Section 01
[Main Floor] Introduction to The Credibility Cost of Chain-of-Thought Compression: A Study on the Trade-off Between Efficiency and Safety
This paper is the first systematic study on the impact of chain-of-thought compression on model credibility. It finds that while compression reduces reasoning costs, it impairs safety, hallucination resistance, and multilingual robustness. The study proposes an alignment-aware DPO variant, which achieves a 19.3% chain-of-thought compression rate while significantly reducing credibility loss. This thread will elaborate on the background, problems, methods, solutions, and suggestions in separate floors.