Section 01
FerrisRes: A Next-Generation LLM Inference Engine Built Entirely with Rust (Introduction)
FerrisRes is an LLM inference and training engine written entirely in Rust. Its core innovation is the Block AttnRes architecture (linear time complexity), supporting cross-platform GPU acceleration (compatible with Vulkan/Metal/DX12/WebGPU via wgpu), and completely ditching Python dependencies. It aims to solve problems in the Python ecosystem such as GIL limitations, dynamic type risks, and complex dependency chains, making it suitable for edge devices, cross-platform deployment, and resource-constrained environments.