Section 01
[Introduction] KServe: Core Overview of the Standardized AI Inference Platform on Kubernetes
KServe is an open-source AI inference platform incubated by the Cloud Native Computing Foundation (CNCF). It aims to provide a unified and standardized solution for Kubernetes, supporting two types of workloads: generative AI (large language models, etc.) and predictive AI (traditional machine learning models). It addresses infrastructure challenges enterprises face when deploying AI inference services on K8s, such as multi-framework adaptation, auto-scaling, and GPU optimization, and has been used in production environments by enterprises in finance, technology, manufacturing, and other industries.