Section 01
[Introduction] AIPerf: A Comprehensive Evaluation Tool for Generative AI Inference Performance
AIPerf is an open-source generative AI model performance benchmarking tool by NVIDIA. It supports multi-process architecture, various endpoint protocols, and rich evaluation modes, enabling accurate assessment of large model inference performance. It provides detailed performance metric analysis to help developers optimize model deployment strategies.