Section 01
Introduction: Panoramic Evolution of LLM Reasoning Technologies
This article systematically reviews the development of large language model (LLM) reasoning technologies, from basic chain-of-thought prompting to the latest process reward model training. It covers key methods such as Self-Consistency, Tree-of-Thoughts, and Program-of-Thought, and compares the performance differences of various technical routes in tasks like mathematical reasoning and commonsense question answering based on comprehensive data from over 50 studies, providing a panoramic perspective for researchers and practitioners.