Section 01
[Main Floor] BitStateLM: Core Guide to the Large Model Engine with No Matrix Multiplication Running on 1GB Memory
BitStateLM is a large model inference engine designed specifically for edge devices, developed by puzzlesnotpeople. It innovatively integrates the RWKV linear attention mechanism and BitNet 1.58-bit quantization technology, enabling efficient inference with only 8.7MB of storage and less than 1GB of running memory. It supports a dependency-free C++ engine and WebAssembly browser deployment, providing an AI inference solution for resource-constrained environments.