Semi-Supervised Formality Style Transfer with Consistency Training

Ao Liu, An Wang, Naoaki Okazaki


Abstract
Formality style transfer (FST) is a task that involves paraphrasing an informal sentence into a formal one without altering its meaning. To address the data-scarcity problem of existing parallel datasets, previous studies tend to adopt a cycle-reconstruction scheme to utilize additional unlabeled data, where the FST model mainly benefits from target-side unlabeled sentences. In this work, we propose a simple yet effective semi-supervised framework to better utilize source-side unlabeled sentences based on consistency training. Specifically, our approach augments pseudo-parallel data obtained from a source-side informal sentence by enforcing the model to generate similar outputs for its perturbed version. Moreover, we empirically examined the effects of various data perturbation methods and propose effective data filtering strategies to improve our framework. Experimental results on the GYAFC benchmark demonstrate that our approach can achieve state-of-the-art results, even with less than 40% of the parallel data.
Anthology ID:
2022.acl-long.321
Volume:
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
May
Year:
2022
Address:
Dublin, Ireland
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4689–4701
Language:
URL:
https://aclanthology.org/2022.acl-long.321
DOI:
10.18653/v1/2022.acl-long.321
Bibkey:
Cite (ACL):
Ao Liu, An Wang, and Naoaki Okazaki. 2022. Semi-Supervised Formality Style Transfer with Consistency Training. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4689–4701, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Semi-Supervised Formality Style Transfer with Consistency Training (Liu et al., ACL 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.acl-long.321.pdf
Code
 aolius/semi-fst
Data
GYAFC