Reading

From Intent to Execution: A Multi-Agent Workflow Auto-Orchestration Framework Based on Agent Recommendation

This article introduces an automated multi-agent system construction framework. Through an LLM-driven planner, dynamic call graph, and two-stage agent recommendation system, it transforms manual workflow orchestration into an automated process, significantly improving the recall rate of agent selection and system robustness.

多智能体系统智能体推荐工作流编排LLM规划信息检索自动化框架任务分解

Published 2026-05-06 01:08Recent activity 2026-05-06 11:52Estimated read 7 min

From Intent to Execution: A Multi-Agent Workflow Auto-Orchestration Framework Based on Agent Recommendation

Section 01

Introduction: Multi-Agent Workflow Auto-Orchestration Framework Based on Agent Recommendation

This article proposes an automated multi-agent system construction framework to address the pain points of manual construction of current multi-agent systems (MAS), including manual planning, complex agent selection, and tedious execution graph construction. Through core modules such as an LLM-driven planner, dynamic call graph, and two-stage agent recommendation system, the framework transforms manual workflow orchestration into an automated process, significantly improving the recall rate of agent selection and system robustness, and promoting a key shift of MAS from manual crafting to automated assembly.

Section 02

Three Major Dilemmas in Multi-Agent System Construction

Current multi-agent system development faces three challenges:

Manual Planning: Requires manual design of execution plans, prediction of step inputs/outputs, and handling of edge cases—time-consuming, error-prone, and difficult to adapt to demand changes;
Complex Agent Selection: Rapid growth in the number of agents, each with capability boundaries, performance characteristics, and costs, leading to heavy manual evaluation and selection burdens;
Tedious Execution Graph Construction: Assembling call graphs requires a large amount of boilerplate code (parameter mapping, error handling, etc.), occupying significant development time.

Section 03

Analysis of Five Core Modules of the Automated Framework

The framework includes five closely collaborative modules:

LLM-Driven Planner: Receives natural language intent, outputs a set of structured task descriptions, and dynamically generates task decomposition and input/output specifications adaptively;
Natural Language Task Description: Expresses task goals, constraints, etc., with rich semantics, supporting semantic similarity matching;
Dynamic Call Graph: Explicitly represents task dependencies and can adjust execution paths (e.g., branches, loops) based on runtime conditions;
Agent Orchestrator: Maintains an agent registry and maps tasks to agents based on factors such as capability matching degree and historical success rate;
Two-Stage Agent Recommendation System: Fast retrieval (embedding model vector matching to filter candidates) + LLM reordering (fine-grained semantic matching to rank), balancing efficiency and effectiveness.

Section 04

Experimental Exploration: Component Optimization and Key Findings

The research team optimized the framework components through experiments:

Embedding Model Selection: Domain-specific embedding models (e.g., code/tool description fine-tuned models) outperform general models;
Reordering Strategy: LLM-generated reasoning (Chain-of-Thought) improves accuracy and interpretability, while introducing negative samples enhances discrimination ability;
Agent Description Enhancement: Analyzing historical logs to extract successful/failed scenarios to enrich descriptions, improving matching recall rate;
Critic Agent: Reviews recommendation results from dimensions of task coverage, redundancy, risk diversification, and cost-effectiveness to further improve recall rate.

Section 05

End-to-End Test Results: Validation of Recall Rate and Robustness

End-to-end benchmark tests cover scenarios such as data analysis and code generation, with results showing:

Significant Recall Rate Improvement: Compared to existing methods, the recall rate of agent selection is significantly higher;
Robustness and Scalability: Response time remains sub-second as the number of agents grows, and selection quality is stable under task description variations/noise, making it suitable for production deployment.

Section 06

Industry Implications and Future Outlook

Industry Implications:

Need to establish standardized specifications for agent capability descriptions;
Developer role shifts from "writing call code" to "defining task intent";
New human-machine collaboration model: The framework handles matching and orchestration, while humans focus on intent clarification and quality control.

Future Outlook: The auto-orchestration framework will become a key component of AI infrastructure, and design ideas such as two-stage recommendation and dynamic call graph may become industry standard practices.

From Intent to Execution: A Multi-Agent Workflow Auto-Orchestration Framework Based on Agent Recommendation

Introduction: Multi-Agent Workflow Auto-Orchestration Framework Based on Agent Recommendation

Three Major Dilemmas in Multi-Agent System Construction

Analysis of Five Core Modules of the Automated Framework

Experimental Exploration: Component Optimization and Key Findings

End-to-End Test Results: Validation of Recall Rate and Robustness

Industry Implications and Future Outlook

Continue Reading

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

LLM-assisted-analysis: A New Approach to Detecting Logical Vulnerabilities in Smart Contracts Using Large Language Models

Building Modern LLM from Scratch: A Tutorial-level Implementation of Llama-style Language Model