Section 01
fast-slow-llm: Introduction to the Dual-System Intelligent Routing Gateway
fast-slow-llm is an LLM gateway system inspired by the dual-system theory from Daniel Kahneman's Thinking, Fast and Slow. It dynamically routes queries via intelligent routing to either the fast and low-cost System 1 model or the deep-reasoning System 2 model, achieving up to 99% API cost savings while maintaining response quality.