Section 01
【Introduction】PocketLLM: Meta-Network Driven Extreme Compression of Large Models, A New Breakthrough in Edge Deployment
PocketLLM is a large model compression method based on meta-networks proposed by authors such as Ye Tian and Chengcheng Wang. By projecting LLM weights into a discrete latent space using an encoder-codebook-decoder architecture, it achieves nearly lossless performance at a 10x compression ratio. This work has been accepted by AAAI 2026, and the project is open-sourced on GitHub, providing a feasible solution for deploying large models on edge devices. The original sources are GitHub/arXiv, paper link: https://arxiv.org/abs/2511.17637, published in November 2025 (arXiv submission).