Section 01
导读 / 主楼:moe-compress: A One-Stop MoE Model Compression Tool to Simplify Large Model Deployment Processes
Introduction / Main Floor: moe-compress: A One-Stop MoE Model Compression Tool to Simplify Large Model Deployment Processes
Introducing the moe-compress project, an automated compression tool designed specifically for Mixture of Experts (MoE) models, supporting calibration, pruning, quantization, benchmarking, and report generation