Section 01
Janus: A High-Performance Modular LLM Inference Engine Built with Rust (Introduction)
Janus is a high-performance large language model (LLM) inference engine developed using Rust. It features a modular architecture, supports deterministic routing between local and cloud models, provides a dynamic native plugin system, and is optimized for Agentic and role-playing workflows. Its core goal is to address the pain points of existing inference frameworks in terms of performance, modularity, and scalability.