Section 01
Introduction to the llama-sandbox Project: Focus on LLM Inference Experiments with llama.cpp and MLX
llama-sandbox is an experimental project collection centered around the llama.cpp and Apple MLX frameworks, exploring efficient inference techniques for large language models (LLMs) on Apple Silicon and other platforms. Positioned as an experimental sandbox, each subdirectory is an independent experiment, suitable for developers who want to deeply understand the underlying mechanisms of LLM inference and explore edge computing optimizations—it is a valuable learning resource.