Section 01
Introduction: llm-d-diagnostics—A Diagnostic Tool for Distributed Inference of Large Language Models
This article introduces the open-source toolkit llm-d-diagnostics, designed specifically for distributed inference scenarios of large language models. It helps developers diagnose and optimize performance bottlenecks and system issues, covering core capabilities such as monitoring, bottleneck localization, and report generation, and is suitable for various deployment modes.