Section 01
Airunway: Guide to the Kubernetes-Native Multi-Provider AI Inference Platform
Airunway is an open-source Kubernetes-native AI inference platform designed for multi-provider environments, supporting mainstream inference engines like vLLM, Ray Serve, and NVIDIA Dynamo. It aims to address challenges enterprises face when deploying and managing AI inference workloads—such as unified cross-cloud management, resource scheduling, and cost control—by providing a flexible and efficient inference service solution.