Improving Authorship Privacy: Adaptive Obfuscation with the Dynamic Selection of Techniques
Hemanth Kandula, Damianos Karakos, Haoling Qiu, Brian Ulicny
Abstract
Authorship obfuscation, the task of rewriting text to protect the original author’s identity, is becoming increasingly important due to the rise of advanced NLP tools for authorship attribution techniques. Traditional methods for authorship obfuscation face significant challenges in balancing content preservation, fluency, and style concealment. This paper introduces a novel approach, the Obfuscation Strategy Optimizer (OSO), which dynamically selects the optimal obfuscation technique based on a combination of metrics including embedding distance, meaning similarity, and fluency. By leveraging an ensemble of language models OSO achieves superior performance in preserving the original content’s meaning and grammatical fluency while effectively concealing the author’s unique writing style. Experimental results demonstrate that the OSO outperforms existing methods and approaches the performance of larger language models. Our evaluation framework incorporates adversarial testing against state-of-the-art attribution systems to validate the robustness of the obfuscation techniques. We release our code publicly at https://github.com/BBN-E/ObfuscationStrategyOptimizer- Anthology ID:
- 2024.privatenlp-1.14
- Volume:
- Proceedings of the Fifth Workshop on Privacy in Natural Language Processing
- Month:
- August
- Year:
- 2024
- Address:
- Bangkok, Thailand
- Editors:
- Ivan Habernal, Sepideh Ghanavati, Abhilasha Ravichander, Vijayanta Jain, Patricia Thaine, Timour Igamberdiev, Niloofar Mireshghallah, Oluwaseyi Feyisetan
- Venues:
- PrivateNLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 137–142
- Language:
- URL:
- https://aclanthology.org/2024.privatenlp-1.14
- DOI:
- Cite (ACL):
- Hemanth Kandula, Damianos Karakos, Haoling Qiu, and Brian Ulicny. 2024. Improving Authorship Privacy: Adaptive Obfuscation with the Dynamic Selection of Techniques. In Proceedings of the Fifth Workshop on Privacy in Natural Language Processing, pages 137–142, Bangkok, Thailand. Association for Computational Linguistics.
- Cite (Informal):
- Improving Authorship Privacy: Adaptive Obfuscation with the Dynamic Selection of Techniques (Kandula et al., PrivateNLP-WS 2024)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-5/2024.privatenlp-1.14.pdf