Section 01
Introduction: GPU Forecasters—Accelerating GPU Kernel Optimization with LLM
Introduction: GPU Forecasters—Accelerating GPU Kernel Optimization with LLM
This article introduces the GPU Forecasters method published on arXiv on May 29, 2026. Its core is to use large language models (LLM) as selective proxies to predict the relative performance of GPU kernels, reducing reliance on actual hardware measurements. Under the same GPU budget, this method can evaluate several times more candidate solutions, significantly improving kernel search efficiency and aiding deep learning system optimization.