Section 01
[Introduction] Core Overview of the Practical Guide to Local Large Language Models
This personal note records how to fully run, fine-tune, and deploy large language models in a local environment, covering mainstream tools like llama.cpp, Ollama, MLX, and advanced topics such as RAG, model merging, and safety guardrails. It provides developers with a reusable practical path, especially verifying inference performance for Apple Silicon platform users. The core value lies in giving users full control over models and avoiding dependency on commercial APIs.