Zing Forum

Reading

FlowBank: Query-Adaptive Agent Workflow Optimization via Precomputation and Reuse

FlowBank proposes a new paradigm for agent workflow optimization. By building a reusable workflow library and adaptively selecting workflows during inference, it addresses the dilemma between offline computation and inference costs faced by traditional methods.

智能体工作流大语言模型多智能体系统查询自适应工作流优化预计算FlowBank
Published 2026-06-10 01:58Recent activity 2026-06-11 09:48Estimated read 5 min
FlowBank: Query-Adaptive Agent Workflow Optimization via Precomputation and Reuse
1

Section 01

FlowBank: A New Paradigm for Agent Workflow Optimization

FlowBank proposes a new paradigm for agent workflow optimization. By building a reusable workflow library and adaptively selecting workflows during inference, it resolves the dilemma between offline computation and inference costs in traditional methods. Its core idea is to treat task-level and query-level optimization paradigms as complementary, constructing a compact complementary workflow library to balance performance and cost.

2

Section 02

Background: The Dilemma of Agent Workflow Optimization

Multi-agent systems based on large language models are powerful, but current workflow optimization faces a trade-off dilemma: task-level methods search for optimal workflows offline but waste complementary candidates, while query-level methods synthesize workflows from scratch for each query, leading to extremely high inference costs. The essence is the need to choose between the high cost of offline precomputation and the inference cost of per-query generation.

3

Section 03

Core Insight: Complementary Rather Than Competitive Optimization Approach

The FlowBank team found that task-level and query-level paradigms are complementary: workflows from offline searches cover different query subsets, and a single optimal workflow cannot meet all needs; many queries can be solved by precomputed workflows instead of expensive query-level generation. Thus, they propose a new goal: building a compact, reusable complementary workflow library and adaptively selecting workflows during inference.

4

Section 04

FlowBank Framework: A Three-Stage Solution

FlowBank solves the problem through three stages: 1. The diversification stage uses the DiverseFlow algorithm to generate diverse candidate workflows covering a wide range of query types; 2. The curation stage uses the CuraFlow algorithm to compress candidates into a compact combination, minimizing redundancy while retaining coverage capability; 3. The matching stage models query-workflow assignment as edge value prediction, routing queries to the workflow with the highest expected utility.

5

Section 05

Experimental Validation: Performance of FlowBank

In five benchmark tests, FlowBank achieved the highest average score while maintaining competitive cost efficiency. It improved by 4.26% compared to the strongest automated baseline and 14.92% compared to the strongest manually designed baseline, breaking through the traditional performance-cost trade-off dilemma.

6

Section 06

Practical Significance and Application Prospects

For system architects: It provides an efficient and flexible agent system framework, with precomputed libraries reducing inference costs; For researchers: It opens up a new direction for workflow combination optimization; For practical applications: It reduces operational costs and improves service consistency in scenarios such as customer service and code generation.

7

Section 07

Summary and Outlook

FlowBank rethinks the optimization paradigm, breaking through traditional limitations with precomputation + adaptive selection. The core innovation is building a complementary workflow library and making intelligent selections; this idea may inspire other AI scenarios that balance precomputation and flexibility. In the future, more advanced diversification strategies, compression algorithms, and routing models can be explored.