HTMuon: Improving Muon via Heavy-Tailed Spectral Correction

Tianyu Pang, Yujie Fang, Zihang Liu, Shenyang Deng, Lei Hsiung, Shuhua Yu, Yaoqing Yang


Abstract
has recently shown promising results in LLM training. In this work, we study how to further improve . We argue that ’s orthogonalized update rule suppresses the emergence of heavy-tailed weight spectra and over-emphasizes the training along noise-dominated directions. Motivated by the Heavy-Tailed Self-Regularization (HT-SR) theory, we propose . preserves ’s ability to capture parameter interdependencies while producing heavier-tailed updates and inducing heavier-tailed weight spectra. Experiments on LLM pretraining and image classification show that consistently improves performance over state-of-the-art baselines and can also serve as a plug-in on top of existing variants. For example, on LLaMA pretraining on the C4 dataset, reduces perplexity by up to 0.98 compared to . We further theoretically show that corresponds to steepest descent under the Schatten-q norm constraint and provide convergence analysis in smooth non-convex settings. The implementation of is available at https://github.com/TDCSZ327/HTmuon.
Anthology ID:
2026.findings-acl.1819
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
36504–36535
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1819/
DOI:
Bibkey:
Cite (ACL):
Tianyu Pang, Yujie Fang, Zihang Liu, Shenyang Deng, Lei Hsiung, Shuhua Yu, and Yaoqing Yang. 2026. HTMuon: Improving Muon via Heavy-Tailed Spectral Correction. In Findings of the Association for Computational Linguistics: ACL 2026, pages 36504–36535, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
HTMuon: Improving Muon via Heavy-Tailed Spectral Correction (Pang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1819.pdf
Checklist:
 2026.findings-acl.1819.checklist.pdf