Section 01
Introduction / Main Floor: HiPER: A Hierarchical Reinforcement Learning Framework with Explicit Credit Assignment for Large Language Model Agents
HiPER is an innovative hierarchical reinforcement learning framework designed specifically for training large language model (LLM) agents to perform tasks in long-horizon environments. By explicitly separating high-level planning from low-level execution and introducing the Hierarchical Advantage Estimation (HAE) mechanism, this framework effectively addresses the credit assignment problem across multiple time scales, achieving leading performance on the ALFWorld and WebShop benchmarks.