Section 01
Introduction: Near-Miss Crisis in Agent Workflows and Detection Methods
This article focuses on the "near-miss" phenomenon in LLM agent workflows—potential failures where the result is correct but the decision-making process bypasses key strategy checks. The study reveals the blind spot of traditional evaluations that only focus on final results, proposes a detection method based on the ToolGuard framework, and finds through experiments that 8-17% of correct results have near-miss risks, emphasizing that agent systems need to shift from "correct results" to "trustworthy processes".