Section 01
Introduction / Main Floor: OpenVINO GPU Inference Performance Evaluation Tool: ov-impact-bench - Real-World Testing of Intel GPU LLM Inference Performance
ov-impact-bench is a tool specifically designed to measure the inference performance of large language models (LLMs) using OpenVINO on Intel GPUs. It can quantify the real performance differences between GPU and CPU fallback, covering key metrics such as latency, energy consumption, and throughput.