Section 01
Introduction: COMPASS—A Cognitive MCTS-Guided Process Alignment Framework for Safe Search Agents
COMPASS is a novel process alignment framework for safe search agents, designed to address retrieval-induced safety issues in multi-step interactions. Its core adopts a dual-pillar design: Cognitive Tree Exploration (CTE) and Introspective Step-by-Step Alignment (ISA), which achieves an effective balance between safety and utility by proactively discovering hidden attack trajectories and fine-grained risk localization.