Section 01
Cider Project Overview: An MLX Extension Unlocking INT8 Inference on Apple Silicon
Cider is an MLX extension project for Apple Silicon chips. It unlocks underutilized INT8 tensor operation capabilities through custom primitives, enabling W8A8/W4A8 quantized inference. This significantly boosts the prefill speed of large language models (1.2-1.9x) and fully leverages the hardware potential of Apple Silicon.