Section 01
Spark-Stack Overview: A Local LLM Inference Monitoring Dashboard for NVIDIA DGX Spark
Spark-Stack is an open-source monitoring tool designed specifically for local large language model (LLM) inference scenarios on NVIDIA DGX Spark. It integrates system metrics, vLLM inference observability, and persistent token tracking, adopting a "history-first" philosophy to provide a long-term activity tracking experience similar to WakaTime, filling the gap of dedicated monitoring solutions in the DGX Spark ecosystem. The project is developed and maintained by Sahil Kapoor (@kapoorsahil), with code hosted on GitHub (link: https://github.com/kapoorsahil/spark-stack) and released in May 2025.