章节 01
llm-pool: FastAPI-based LLM Inference Pooling Service Overview
Core Introduction llm-pool is a FastAPI-built LLM inference pooling service supporting mixed deployment of local models and OpenAI-compatible remote APIs. It offers scheduling management, replica control, metrics monitoring, and admin API functions, ideal for enterprise scenarios requiring unified management of multiple LLM backends.
Source Info
- Maintainer: Bobcat
- Platform: GitHub
- Release Time: 2026-06-09
- Repository Link: https://github.com/Bobcat/llm-pool