Section 01
Introduction: Core Findings on Subgoal Persistence in Hierarchical Latent Reasoning
This paper is from arXiv (published in June 2026, original title: When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning), focusing on the trade-off of subgoal duration in hierarchical latent reasoning models. Experiments show that a moderate persistence period (P=3-6 steps) is the optimal choice—both too short or too long periods lead to performance degradation, providing important guiding principles for the design of combinatorial planning systems.