Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly

Wenya Xie; Shaochen Zhong; Hoang Anh Duy Le; Zhaozhuo Xu; Jianwen Xie; Zirui Liu

doi:10.18653/v1/2025.emnlp-main.1705

Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly

Wenya Xie, Shaochen Zhong, Hoang Anh Duy Le, Zhaozhuo Xu, Jianwen Xie, Zirui Liu

Abstract

Large Reasoning Models (LRMs) are often bottlenecked by the high cost of output tokens. We show that a significant portion of these tokens are useless self-repetitions — what we call “word salad” — that exhaust the decoding budget without adding value. Interestingly, we observe that LRMs are self-aware when trapped in these loops: the hidden states of ‘‘ tokens trailing each reasoning chunk exhibit patterns that allow us to detect word salad behavior on-the-fly via a single linear classifier. Once detected, a simple chop appended by a straightforward regeneration prompt yields substantial length savings with minimal quality loss. Our work offers WordSaladChopper (WSC) — a lightweight, turnkey component for LRM that is minimally invasive to its reasoning trajectory. Given its low overhead, strong savings, and the lack of semantic value of word salad tokens, we believe it is not too far-fetched to argue that WSC — or a similar component — is a must-have for all LRM applications with user experience in mind.

Anthology ID:: 2025.emnlp-main.1705
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 33576–33586
Language:
URL:: https://preview.aclanthology.org/ingest-luhme/2025.emnlp-main.1705/
DOI:: 10.18653/v1/2025.emnlp-main.1705
Bibkey:
Cite (ACL):: Wenya Xie, Shaochen Zhong, Hoang Anh Duy Le, Zhaozhuo Xu, Jianwen Xie, and Zirui Liu. 2025. Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 33576–33586, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly (Xie et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-luhme/2025.emnlp-main.1705.pdf
Checklist:: 2025.emnlp-main.1705.checklist.pdf

PDF Cite Search Checklist Fix data