Section 01
[Introduction] BenchFlow: A Reproducible Control Plane Framework for LLM Inference Benchmarking in OpenShift Environments
This article introduces the BenchFlow project, a control plane for LLM inference benchmarking specifically designed for OpenShift environments. It addresses issues like poor environmental consistency and difficult resource scheduling in traditional benchmarking. Built on cloud-native components such as Tekton and Kueue, it supports single/multi-cluster deployment and matrix experiments, integrates GuideLLM and MLflow, and provides a reproducible and traceable solution for model performance evaluation in Kubernetes environments.