Section 01
[Introduction] Chiplet-Contiguous Layout: A New Scheme for Optimizing Multi-Chiplet GPU Memory Layout
Core Point: This article proposes the Chiplet-Contiguous Layout technology, which solves the incompatibility between locality-aware data placement and fixed page-granularity data interleaving in multi-chiplet GPUs by storing chiplet-local data contiguously. It achieves significant reduction in remote HBM traffic for GEMM workloads of Qwen 3 30B and Llama 3.1 70B models.
Original Author and Source:
- Original Author/Maintainer: arXiv authors
- Source Platform: arXiv
- Original Title: Making Locality-aware GEMM Compatible with Page-Granularity Placement on Chiplet GPUs
- Original Link: http://arxiv.org/abs/2606.11718v1
- Source Publication/Update Time: 2026-06-10T06:47:27Z