Section 01
Introduction to the SHAPE Framework: A Training-Free Pruning Solution for MoE Models Based on Shapley Value
SHAPE (SHapley-Aware Pruning of Experts) is a training-free pruning framework for Mixture-of-Experts (MoE) large language models. It corely uses Shapley Value from game theory to quantify the marginal contribution of experts, enabling intelligent expert selection. This framework aims to solve the problems of MoE model size expansion, memory usage, and inference latency, maintaining performance and reducing computational overhead without retraining.