Improving Authorship Privacy: Adaptive Obfuscation with the Dynamic Selection of Techniques

Hemanth Kandula, Damianos Karakos, Haoling Qiu, Brian Ulicny


Abstract
Authorship obfuscation, the task of rewriting text to protect the original author’s identity, is becoming increasingly important due to the rise of advanced NLP tools for authorship attribution techniques. Traditional methods for authorship obfuscation face significant challenges in balancing content preservation, fluency, and style concealment. This paper introduces a novel approach, the Obfuscation Strategy Optimizer (OSO), which dynamically selects the optimal obfuscation technique based on a combination of metrics including embedding distance, meaning similarity, and fluency. By leveraging an ensemble of language models OSO achieves superior performance in preserving the original content’s meaning and grammatical fluency while effectively concealing the author’s unique writing style. Experimental results demonstrate that the OSO outperforms existing methods and approaches the performance of larger language models. Our evaluation framework incorporates adversarial testing against state-of-the-art attribution systems to validate the robustness of the obfuscation techniques. We release our code publicly at https://github.com/BBN-E/ObfuscationStrategyOptimizer
Anthology ID:
2024.privatenlp-1.14
Volume:
Proceedings of the Fifth Workshop on Privacy in Natural Language Processing
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Ivan Habernal, Sepideh Ghanavati, Abhilasha Ravichander, Vijayanta Jain, Patricia Thaine, Timour Igamberdiev, Niloofar Mireshghallah, Oluwaseyi Feyisetan
Venues:
PrivateNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
137–142
Language:
URL:
https://aclanthology.org/2024.privatenlp-1.14
DOI:
Bibkey:
Cite (ACL):
Hemanth Kandula, Damianos Karakos, Haoling Qiu, and Brian Ulicny. 2024. Improving Authorship Privacy: Adaptive Obfuscation with the Dynamic Selection of Techniques. In Proceedings of the Fifth Workshop on Privacy in Natural Language Processing, pages 137–142, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Improving Authorship Privacy: Adaptive Obfuscation with the Dynamic Selection of Techniques (Kandula et al., PrivateNLP-WS 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/dois-2013-emnlp/2024.privatenlp-1.14.pdf