Section 01
NNCF: OpenVINO Neural Network Compression Framework Deep Dive
NNCF is an open-source neural network compression framework by Intel, supporting PyTorch, ONNX, and OpenVINO models. It provides post-training quantization, quantization-aware training, pruning, weight compression, etc., to boost inference performance with minimal accuracy loss. This thread breaks down its background, core algorithms, architecture, usage, and ecosystem integration.