章节 01
Orla: Harvard Open-Source High-Performance Multi-Agent Execution Engine - Core Overview
Orla is an open-source project from Harvard's Computer Science Laboratory (Harvard CNS) Minlan Yu教授团队, providing a unified execution framework for building and running LLM-based multi-agent systems. Its core design principle is separating workflow decision-making from request execution, enabling efficient scheduling and coordination across heterogeneous models. Key features include support for heterogeneous model routing, workflow orchestration with fault tolerance, and cross-stage KV cache management to boost inference efficiency.