Section 01
Introduction / Main Floor: GPUStack: Open-Source GPU Cluster Manager Making AI Model Deployment as Easy as Using Docker
GPUStack is an open-source GPU cluster management tool that supports inference engines like vLLM, SGLang, and TensorRT-LLM. It offers multi-cluster management capabilities across on-premises, Kubernetes, and cloud environments, with built-in performance optimization, automatic failover, and OpenAI-compatible APIs.