@inproceedings{harel-etal-2024-protecting,
    title = "Protecting Privacy in Classifiers by Token Manipulation",
    author = "Harel, Re{'}em  and
      Elboher, Yair  and
      Pinter, Yuval",
    editor = "Habernal, Ivan  and
      Ghanavati, Sepideh  and
      Ravichander, Abhilasha  and
      Jain, Vijayanta  and
      Thaine, Patricia  and
      Igamberdiev, Timour  and
      Mireshghallah, Niloofar  and
      Feyisetan, Oluwaseyi",
    booktitle = "Proceedings of the Fifth Workshop on Privacy in Natural Language Processing",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2024.privatenlp-1.4/",
    pages = "29--38",
    abstract = "Using language models as a remote service entails sending private information to an untrusted provider. In addition, potential eavesdroppers can intercept the messages, thereby exposing the information. In this work, we explore the prospects of avoiding such data exposure at the level of text manipulation. We focus on text classification models, examining various token mapping and contextualized manipulation functions in order to see whether classifier accuracy may be maintained while keeping the original text unrecoverable. We find that although some token mapping functions are easy and straightforward to implement, they heavily influence performance on the downstream task, and via a sophisticated attacker can be reconstructed. In comparison, the contextualized manipulation provides an improvement in performance."
}Markdown (Informal)
[Protecting Privacy in Classifiers by Token Manipulation](https://preview.aclanthology.org/ingest-emnlp/2024.privatenlp-1.4/) (Harel et al., PrivateNLP 2024)
ACL