DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models

Xueyu Zhou; Yangrong Hu; Jian Huang

DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models

Abstract

Masked diffusion language models (MDLMs) have recently emerged as a new paradigm in language modeling, offering flexible generation dynamics and enabling efficient parallel decoding. However, existing decoding strategies for pre-trained MDLMs predominantly rely on token-level uncertainty criteria, while largely overlooking sequence-level information and inter-token dependencies. To address this limitation, we propose Dependency-Oriented Sampler (DOS), a training-free decoding strategy that leverages inter-token dependencies to inform token updates during generation. Specifically, DOS exploits attention matrices from transformer blocks to approximate inter-token dependencies, emphasizing information from unmasked tokens when updating masked positions. Empirical results demonstrate that DOS consistently achieves superior performance on both code generation and mathematical reasoning tasks. Moreover, DOS can be seamlessly integrated with existing parallel sampling methods, leading to improved generation efficiency without sacrificing generation quality.

Anthology ID:: 2026.findings-acl.861
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 17404–17419
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.861/
DOI:
Bibkey:
Cite (ACL):: Xueyu Zhou, Yangrong Hu, and Jian Huang. 2026. DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 17404–17419, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models (Zhou et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.861.pdf
Checklist:: 2026.findings-acl.861.checklist.pdf

PDF Cite Search Checklist Fix data