Private Text Classification with Convolutional Neural Networks

Samuel Adams, David Melanson, Martine De Cock


Abstract
Text classifiers are regularly applied to personal texts, leaving users of these classifiers vulnerable to privacy breaches. We propose a solution for privacy-preserving text classification that is based on Convolutional Neural Networks (CNNs) and Secure Multiparty Computation (MPC). Our method enables the inference of a class label for a personal text in such a way that (1) the owner of the personal text does not have to disclose their text to anyone in an unencrypted manner, and (2) the owner of the text classifier does not have to reveal the trained model parameters to the text owner or to anyone else. To demonstrate the feasibility of our protocol for practical private text classification, we implemented it in the PyTorch-based MPC framework CrypTen, using a well-known additive secret sharing scheme in the honest-but-curious setting. We test the runtime of our privacy-preserving text classifier, which is fast enough to be used in practice.
Anthology ID:
2021.privatenlp-1.7
Volume:
Proceedings of the Third Workshop on Privacy in Natural Language Processing
Month:
June
Year:
2021
Address:
Online
Editors:
Oluwaseyi Feyisetan, Sepideh Ghanavati, Shervin Malmasi, Patricia Thaine
Venue:
PrivateNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
53–58
Language:
URL:
https://aclanthology.org/2021.privatenlp-1.7
DOI:
10.18653/v1/2021.privatenlp-1.7
Bibkey:
Cite (ACL):
Samuel Adams, David Melanson, and Martine De Cock. 2021. Private Text Classification with Convolutional Neural Networks. In Proceedings of the Third Workshop on Privacy in Natural Language Processing, pages 53–58, Online. Association for Computational Linguistics.
Cite (Informal):
Private Text Classification with Convolutional Neural Networks (Adams et al., PrivateNLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/improve-issue-templates/2021.privatenlp-1.7.pdf