Section 01
Introduction / Main Floor: Air.rs: Implementing 70B+ Large Model Inference on Consumer GPUs with Rust
Air.rs is a Rust-based LLM inference engine that uses the S.L.I.P. (Slipstream Layer Inference Protocol) protocol. Through memory mapping and layer streaming technology, it enables running large language models with over 70B parameters on consumer GPUs with only 24GB of VRAM.