Task-driven augmented data evaluation

Olga Golovneva, Pan Wei, Khadige Abboud, Charith Peris, Lizhen Tan, Haiyang Yu


Abstract
In the area of data augmentation research, the main focus to date has been on the improvement of the generation models, while the examination and improvements to synthetic data evaluation methods remains less explored. In our work, we explore a number of sentence similarity measures in the context of data generation filtering, and evaluate their impact on the performance of the targeted Natural Language Understanding problem on the example of the intent classification and named entity recognition tasks. Our experiments on ATIS dataset show that the right choice of filtering technique can bring up to 33% in sentence accuracy improvement for targeted underrepresented intents.
Anthology ID:
2022.gem-1.2
Volume:
Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM)
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates (Hybrid)
Editors:
Antoine Bosselut, Khyathi Chandu, Kaustubh Dhole, Varun Gangal, Sebastian Gehrmann, Yacine Jernite, Jekaterina Novikova, Laura Perez-Beltrachini
Venue:
GEM
SIG:
SIGGEN
Publisher:
Association for Computational Linguistics
Note:
Pages:
18–25
Language:
URL:
https://aclanthology.org/2022.gem-1.2
DOI:
10.18653/v1/2022.gem-1.2
Bibkey:
Cite (ACL):
Olga Golovneva, Pan Wei, Khadige Abboud, Charith Peris, Lizhen Tan, and Haiyang Yu. 2022. Task-driven augmented data evaluation. In Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM), pages 18–25, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
Cite (Informal):
Task-driven augmented data evaluation (Golovneva et al., GEM 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-1/2022.gem-1.2.pdf