Discrete Diffusion Language Model for Efficient Text Summarization

Do Huu Dat; Duc Anh Do; Luu Anh Tuan; Wray Buntine

Discrete Diffusion Language Model for Efficient Text Summarization

Do Huu Dat, Duc Anh Do, Anh Tuan Luu, Wray Buntine

Abstract

While diffusion models excel at conditionally generating high-quality images, prior works in discrete diffusion models were not evaluated on conditional long-text generation. This work addresses the limitations of prior discrete diffusion models for conditional long-text generation, particularly in the long abstractive summarization task. Despite faster decoding speeds compared to autoregressive methods, previous discrete diffusion models failed on the abstractive summarization task due to the incompatibility between the backbone architectures and the random noising process. To overcome these challenges, we introduce a novel semantic-aware noising process that enables Transformer backbones to handle long sequences effectively. Additionally, we propose CrossMamba, an adaptation of the Mamba model to the encoder-decoder paradigm, which integrates seamlessly with the random absorbing noising process. Our approaches outperform existing discrete diffusion models on three benchmark summarization datasets: Gigaword, CNN/DailyMail, and Arxiv, while also achieving much faster inference speed compared to autoregressive models.

Anthology ID:: 2025.findings-naacl.352
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6278–6290
Language:
URL:: https://preview.aclanthology.org/fix-sig-urls/2025.findings-naacl.352/
DOI:
Bibkey:
Cite (ACL):: Do Huu Dat, Duc Anh Do, Anh Tuan Luu, and Wray Buntine. 2025. Discrete Diffusion Language Model for Efficient Text Summarization. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 6278–6290, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Discrete Diffusion Language Model for Efficient Text Summarization (Dat et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/fix-sig-urls/2025.findings-naacl.352.pdf

PDF Cite Search Fix data