Section 01
Introduction: llm-speedtest-mcp—Zero-Telemetry LLM Inference Speed Benchmark Tool
llm-speedtest-mcp is a lightweight MCP server tool designed to help users perform standardized inference speed tests on multiple LLM providers within local AI tools. It supports measuring key metrics such as TTFT (Time to First Token) and TPS (Tokens Per Second). With less than 500 lines of code, it adheres to the principles of zero telemetry and zero data collection to ensure privacy security. This tool can seamlessly integrate into AI tools that support the MCP protocol, such as Claude Desktop and Cursor, solving the pain point where LLM users struggle to obtain reliable and comparable inference speed data.