Section 01
Multi-Expert Debate Framework: An Innovative Approach to Enabling Large Models to Think Like a Committee
The multi-model project proposes a multi-expert debate architecture that replaces the traditional chain of thought. By having three expert roles with different perspectives conduct internal debates before providing an answer, it significantly improves reasoning diversity and RLVR training effectiveness. This architecture is fine-tuned based on the Qwen3 model, with controllable training costs, providing a new direction for exploring the reasoning mechanisms of large models.