Section 01
DiScO Framework: Enhancing Reasoning Capabilities of Large Language Models via Diverse Thinking Schemata (Introduction)
- This article introduces the DiScO (Diverse Schemata Policy Optimization) framework, which aims to enhance the diversity of thinking schemata through reinforcement learning, improve the performance of large language models on mathematical reasoning tasks, and strengthen their ability to recover from erroneous attempts.
- Source information: Original authors are arXiv authors, source platform is arXiv, original title is "Diverse Thinking Schemata Elicit Better Reasoning in Large Language Models", link: http://arxiv.org/abs/2606.08974v1, publication time: 2026-06-08T03:17:31Z.
- Core value: Reveals scaling diversity as an effective path to enhance model capabilities, providing new ideas for the design of next-generation reasoning models.