Section 01
Introduction: Hippo — A One-Stop Local LLM Inference and RAG Solution for Consumer Hardware
Hippo is a Python toolkit that integrates local LLM inference and document retrieval. It supports pipeline parallelism to split models across multiple devices, has built-in BM25 + semantic hybrid search, requires no additional vector databases, can be installed via pip install hippo-llm, and enables running 30B models on consumer hardware.