Section 01
Introduction to Eureka Algorithm: LLM-Driven Automated Design of Reinforcement Learning Reward Functions
The Eureka algorithm leverages the code generation and reasoning capabilities of large language models (LLMs) to transform reinforcement learning reward function design into a code generation task. It achieves a paradigm shift from manual design by human experts to autonomous design by AI, solving the bottleneck problems of traditional reward function design—such as being time-consuming, labor-intensive, and difficult to handle complex tasks.