Section 01
Edge-LM Project Guide: Running Compressed Large Language Models Locally on Apple Devices
Edge-LM is an open-source project based on the Apple MLX framework, focusing on running compressed large language models (LLMs) locally on iOS devices and Apple Silicon Macs. It achieves efficient inference on edge devices using Gemma checkpoints with a 7x size reduction. Its core values include fully offline operation to protect privacy, low-latency real-time interaction, no API fees, and no network dependency.