Section 01
HexAGenT: Introduction to the Heterogeneity-Aware Scheduling System for Agent Workflows
HexAGenT is a workflow-aware scheduler for agent LLM applications, designed to optimize end-to-end workflow latency and SLO compliance on heterogeneous GPU clusters. Its core technologies include online DAG modeling, risk-aware priority strategies, and joint resource selection, which can significantly reduce SLO gaps and improve heterogeneous resource utilization.