Section 01
DUME: Guide to the New MoE Method for Dynamic Expert Model Recombination Without Training
Core Guide to DUME
DUME (Dynamic Upcycling MoE) is a new MoE method that dynamically recombines multi-domain expert models without additional training. It achieves expert integration via the closed-form solution of ridge regression, maintaining 97.6% of the original experts' performance while supporting dynamic addition of new experts, solving the cost and efficiency challenges of multi-domain expert integration.
This article will discuss aspects such as background, technical solution, performance verification, dynamic expansion, and application prospects.