Section 01
tps.sh: Guide to Performance Benchmarking Tool for Local and Cloud LLMs
tps.sh is an open-source performance testing tool for large language models, designed specifically for Apple Silicon Macs and supporting Windows platforms. By measuring tokens per second (TPS), output quality, and cost, it helps users compare performance differences between locally deployed models (e.g., Ollama) and cloud API services (e.g., Claude), assisting in choosing the most suitable model solution. The tool covers 147 tests, 7 models, and 21 sample prompts, lowering the technical barrier for LLM performance evaluation.