Section 01
Introduction to The Spectral Filtering Nature of Momentum in the Muon Optimizer: Denoise First, Orthogonalize Later
Core Insights
The study reveals the theoretical role of momentum in the Muon optimizer: momentum acts as a spectral filter, which suppresses disturbances and preserves dominant signals under the structured signal plus perturbed gradient model, amplifies the spectral gap to stabilize the singular subspace of the orthogonalization step; additionally, the order of 'calculating momentum first, then orthogonalizing' is critical, and the theory has been experimentally verified.
Original Article Information
- Original Authors: arXiv Author Team
- Source: arXiv (published on June 2, 2026)
- Original Title: Denoise First, Orthogonalize Later: Understanding Momentum in Muon via Spectral Filtering
- Original Link: http://arxiv.org/abs/2606.03899v1