Section 01
GenMLX: Open-Source Project for Apple Silicon Macs to Build Large Model Inference Clusters
GenMLX is an open-source project that connects multiple Apple Silicon Macs (M-series) via Thunderbolt 5 to form a tensor parallel inference cluster for running large parameter language models. Key features include Web UI management, OpenAI-compatible API, L2 disk cache, heterogeneous memory configuration, and deployment in 15 minutes. It addresses the memory bottleneck of single Macs for large models.