Topic-Based Watermarks for Large Language Models

Alexander Nemecek; Yuzhou Jiang; Erman Ayday

Topic-Based Watermarks for Large Language Models

Alexander Nemecek, Yuzhou Jiang, Erman Ayday

Abstract

The indistinguishability of large language model (LLM) output from human-authored content poses significant challenges, raising concerns about potential misuse of AI-generated text and its influence on future model training. Watermarking algorithms offer a viable solution by embedding detectable signatures into generated text. However, existing watermarking methods often involve trade-offs among attack robustness, generation quality, and additional overhead such as specialized frameworks or complex integrations. We propose a lightweight, topic-guided watermarking scheme for LLMs that partitions the vocabulary into topic-aligned token subsets. Given an input prompt, the scheme selects a relevant topic-specific token list, effectively "green-listing" semantically aligned tokens to embed robust marks while preserving fluency and coherence. Experimental results across multiple LLMs and state-of-the-art benchmarks demonstrate that our method achieves text quality comparable to industry-leading systems and simultaneously improves watermark robustness against paraphrasing and lexical perturbation attacks, with minimal performance overhead. Our approach avoids reliance on additional mechanisms beyond standard text generation pipelines, enabling straightforward adoption and suggesting a practical path toward globally consistent watermarking of AI-generated content.

Anthology ID:: 2026.findings-acl.1220
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 24372–24402
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1220/
DOI:
Bibkey:
Cite (ACL):: Alexander Nemecek, Yuzhou Jiang, and Erman Ayday. 2026. Topic-Based Watermarks for Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 24372–24402, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Topic-Based Watermarks for Large Language Models (Nemecek et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1220.pdf
Checklist:: 2026.findings-acl.1220.checklist.pdf

PDF Cite Search Checklist Fix data