Leveraging Semantic Triples for Private Document Generation with Local Differential Privacy Guarantees

Stephen Meisenbacher, Maulik Chevli, Florian Matthes


Abstract
Many works at the intersection of Differential Privacy (DP) in Natural Language Processing aim to protect privacy by transforming texts under DP guarantees. This can be performed in a variety of ways, from word perturbations to full document rewriting, and most often under *local* DP. Here, an input text must be made indistinguishable from any other potential text, within some bound governed by the privacy parameter 𝜀. Such a guarantee is quite demanding, and recent works show that privatizing texts under local DP can only be done reasonably under very high 𝜀 values. Addressing this challenge, we introduce **DP-ST**, which leverages semantic triples for neighborhood-aware private document generation under local DP guarantees. Through the evaluation of our method, we demonstrate the effectiveness of the *divide-and-conquer* paradigm, particularly when limiting the DP notion (and privacy guarantees) to that of a *privatization neighborhood*. When combined with LLM post-processing, our method allows for coherent text generation even at lower 𝜀 values, while still balancing privacy and utility. These findings highlight the importance of coherence in achieving balanced privatization outputs at reasonable 𝜀 levels.
Anthology ID:
2025.emnlp-main.455
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8987–9003
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.455/
DOI:
Bibkey:
Cite (ACL):
Stephen Meisenbacher, Maulik Chevli, and Florian Matthes. 2025. Leveraging Semantic Triples for Private Document Generation with Local Differential Privacy Guarantees. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 8987–9003, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Leveraging Semantic Triples for Private Document Generation with Local Differential Privacy Guarantees (Meisenbacher et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.455.pdf
Checklist:
 2025.emnlp-main.455.checklist.pdf