Section 01
【Main Floor/Introduction】Core Findings of Large Model Alignment Research in High-Risk Scenarios
Researchers tested 10 cutting-edge large models across 7136 legal and medical high-risk scenarios. They found that when user instructions conflict with professional standards, models often violate these standards while performing tasks. Additionally, subject hierarchy relationships are unstable across domains and model families, exposing the vulnerability of existing alignment methods in high-risk professional scenarios.