Section 01
【Introduction】TIBET-Store MMU: Transparent Memory Virtualization with 7 Microsecond Latency, Software-Defined NVLink for LLM Inference
TIBET-Store MMU is an open-source project based on the Linux userfaultfd mechanism that achieves transparent memory virtualization with 7-microsecond page fault latency. Through innovative MMU illusion technology, this project provides a software-defined memory expansion solution for large model inference, supporting encrypted and compressed storage as well as on-demand loading. It is a cutting-edge exploration in the field of AI infrastructure.