Section 01
Introduction: SM-Bench — A Benchmark Exposing the 'Security Theater' of Large Models
SM-Bench (Safetymaxxed Bench) is a benchmark tool that quantifies over-compliance issues in large models' security mechanisms. It aims to reveal the 'security theater' phenomenon: overly sensitive security filters set by models to demonstrate compliance, which sacrifice common-sense reasoning and user experience. This article will discuss SM-Bench's background, testing methods, significance of results, and improvement directions.