A Linguistics-Aware LLM Watermarking via Syntactic Predictability

Shinwoo Park; Hyejin Park; Hyeseon An; Yo-Sub Han

A Linguistics-Aware LLM Watermarking via Syntactic Predictability

Shinwoo Park, Hyejin Park, Hyeseon An, Yo-Sub Han

Abstract

As large language models (LLMs) continue to advance rapidly, reliable governance tools have become critical. Publicly verifiable watermarking is particularly essential for fostering a trustworthy AI ecosystem. A central challenge persists: balancing text quality against detection robustness. Recent studies have sought to navigate this trade-off by leveraging signals from model output distributions (e.g., token-level entropy); however, their reliance on these model-specific signals presents a significant barrier to public verification, as the detection process requires access to the logits of the underlying model. We introduce STELA, a novel framework that aligns watermark strength with the linguistic degrees of freedom inherent in language. STELA dynamically modulates the signal using part-of-speech (POS) n-gram–modeled linguistic indeterminacy, weakening it in grammatically constrained contexts to preserve quality and strengthening it in contexts with greater linguistic flexibility to enhance detectability. Our detector operates without access to any model logits, thus facilitating publicly verifiable detection. Through extensive experiments on typologically diverse languages—analytic English, isolating Chinese, and agglutinative Korean—we show that STELA surpasses prior methods in detection robustness. Our code is available at https://github.com/Shinwoo-Park/stela_watermark.

Anthology ID:: 2026.acl-long.2115
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 45629–45647
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.2115/
DOI:
Bibkey:
Cite (ACL):: Shinwoo Park, Hyejin Park, Hyeseon An, and Yo-Sub Han. 2026. A Linguistics-Aware LLM Watermarking via Syntactic Predictability. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 45629–45647, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: A Linguistics-Aware LLM Watermarking via Syntactic Predictability (Park et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.2115.pdf
Checklist:: 2026.acl-long.2115.checklist.pdf

PDF Cite Search Checklist Fix data