Section 01
Introduction: kvcache-sim—A Multi-Tier KV Cache Simulation System for Large Model Inference
kvcache-sim is a KV cache simulator supporting HBM/DRAM/SSD three-tier storage architecture, offering three simulation modes (single-node, 10k-card cluster, and PD separation), with six built-in eviction strategies including LRU, ARC, and Learned, which can be used to evaluate the cache efficiency and scalability of LLM inference systems.