TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods

Gabriel Loiseau, Damien Sileo, Damien Riquet, Maxime Meyer, Marc Tommasi


Abstract
Authorship obfuscation aims to disguise the identity of an author within a text by altering the writing style, vocabulary, syntax, and other linguistic features associated with the text author. This alteration needs to balance privacy and utility. While strong obfuscation techniques can effectively hide the author’s identity, they often degrade the quality and usefulness of the text for its intended purpose. Conversely, maintaining high utility tends to provide insufficient privacy, making it easier for an adversary to de-anonymize the author. Thus, achieving an optimal trade-off between these two conflicting objectives is crucial. In this paper, we propose **TAROT**: **T**ask-Oriented **A**utho**r**ship **O**bfuscation Using Policy Op**t**imization, a new unsupervised authorship obfuscation method whose goal is to optimize the privacy-utility trade-off by regenerating the entire text considering its downstream utility. Our approach leverages policy optimization as a fine-tuning paradigm over small language models in order to rewrite texts by preserving author identity and downstream task utility. We show that our approach largely reduces the accuracy of attackers while preserving utility. We make our code and models publicly available.
Anthology ID:
2025.privatenlp-main.2
Volume:
Proceedings of the Sixth Workshop on Privacy in Natural Language Processing
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Ivan Habernal, Sepideh Ghanavati, Vijayanta Jain, Timour Igamberdiev, Shomir Wilson
Venues:
PrivateNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
14–31
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.privatenlp-main.2/
DOI:
Bibkey:
Cite (ACL):
Gabriel Loiseau, Damien Sileo, Damien Riquet, Maxime Meyer, and Marc Tommasi. 2025. TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods. In Proceedings of the Sixth Workshop on Privacy in Natural Language Processing, pages 14–31, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods (Loiseau et al., PrivateNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.privatenlp-main.2.pdf