Section 01
[Introduction] Safety Risks of Embodied Intelligence: Imbalance Between Planning Capability and Safety Awareness of LLMs
This article reveals key findings through the DESPITE benchmark: large language models (LLMs) have a significant imbalance between planning capability and safety awareness in robot planning tasks. Even models with near-100% planning accuracy still have a 28.3% probability of generating dangerous plans. This phenomenon serves as an important warning for the safe deployment of embodied intelligence.