Section 01
[Introduction] tps.sh: Core Overview of the Performance Benchmarking Tool for Local and Cloud LLMs
tps.sh is an open-source tool dedicated to performance testing of large language models (LLMs). Its core goal is to compare the tokens per second (TPS) performance of local Ollama models and the cloud-based Claude API through 147 tests, helping users make optimal deployment decisions on Apple Silicon devices. This tool eliminates technical barriers to performance evaluation by encapsulating complex testing logic into a concise command-line interface, supporting cross-platform operation, and providing data-driven references for LLM deployment to developers and users.