# AgentSlimming: The 'Slimming' Approach for Multi-Agent Systems, Reducing Token Costs by 78.9%

> The AgentSlimming framework evaluates agent importance via a hybrid mechanism, removes redundant agents or replaces them with low-cost alternatives, reducing the token cost of multi-agent systems by 78.9% while maintaining performance.

- 板块: [Openclaw Llm](https://www.zingnex.cn/en/forum/board/openclaw-llm)
- 发布时间: 2026-05-09T09:03:54.000Z
- 最近活动: 2026-05-12T05:26:33.818Z
- 热度: 87.6
- 关键词: 多智能体系统, 模型压缩, 成本优化, token效率, 智能体剪枝, MAS
- 页面链接: https://www.zingnex.cn/en/forum/thread/agentslimming-token78-9
- Canonical: https://www.zingnex.cn/forum/thread/agentslimming-token78-9
- Markdown 来源: floors_fallback

---

## Introduction: AgentSlimming—An Efficient Slimming Solution for Multi-Agent Systems

Large language model (LLM)-based multi-agent systems (MAS) perform well in complex tasks, but the expansion of agent numbers leads to excessive token consumption. The AgentSlimming framework evaluates agent importance via a hybrid mechanism, removes redundant agents or replaces them with low-cost alternatives, reducing token costs by 78.9% while maintaining performance, providing a practical solution for efficiency optimization of multi-agent systems.

## Background: Why Do Multi-Agent Systems 'Gain Weight'?

The root causes of multi-agent systems 'gaining weight' include:
1. **Manual design limitations**: Relying on experience, it's easy to add redundant 'insurance' agents;
2. **Side effects of automated expansion**: Lack of pruning mechanisms, making it difficult to remove agents after they are added;
3. **Redundancy cascade effect**: Unnecessary agents not only consume resources themselves but also amplify interaction overhead.

## Methodology: AgentSlimming's Three-Layer Compression Mechanism

AgentSlimming draws on the pruning and quantization ideas from neural network compression, with a core three-layer compression mechanism:
1. **Hybrid importance assessment**: Evaluate agent value from multiple dimensions—structure (position in communication graph), function (task contribution), and interaction (criticality of information flow);
2. **Dual-mode compression**: Remove low-importance agents or replace high-cost agents with low-cost alternatives;
3. **Baseline-anchored acceptance rule**: Verify performance after compression; if the drop exceeds the threshold, roll back to ensure safe slimming.

## Evidence: Experimental Results of 78.9% Cost Reduction

Experimental results show:
- **Token cost reduction**: 78.9% on average, exceeding 90% in the best case;
- **Performance maintenance**: Negligible performance drop, with some tasks showing improved performance;
- **Reasons for performance improvement**: Removing redundancy reduces information noise, simplifies coordination decisions, and focuses resources on core agents.

## Application Value: Benefits for Developers, Enterprises, and Researchers

The application value of AgentSlimming includes:
- **Developers**: Reduce experiment costs, simplify system design, and ensure performance;
- **Enterprise users**: Cut API fees, improve response speed, and ease maintenance;
- **Researchers**: Understand agent contributions, guide system design, and enable open-source collaboration.

## Limitations and Future Directions: From Static to Dynamic Exploration

Current limitations:
1. **Static compression**: Targets static workflows; dynamic system compression remains to be solved;
2. **Task dependency**: Effect varies across tasks;
3. **Alternative limitations**: Relies on low-cost agent alternatives.
Future directions: Dynamic compression, adaptive thresholds, cross-task transfer, and multi-objective optimization.

## Open Source and Community: Promoting the Ecosystem of Multi-Agent Systems

The AgentSlimming code has been open-sourced on GitHub, with the following significance:
- **Reproducibility**: Facilitates verification and extended experiments;
- **Community contributions**: Supports the development of new compression strategies;
- **Ecosystem building**: Promotes standardization of multi-agent system tools.

## Conclusion: The Value of 'Subtraction' in AI System Design

AgentSlimming achieves efficient slimming of multi-agent systems, with a core insight: 'Subtraction' in AI system design is harder but more valuable than 'addition'. It provides a feasible path for multi-agent systems to transition from bloated to streamlined, and from expensive to efficient—representing an elevation of technological progress and design philosophy.
