Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context

Hankyol Lee, Youngjae Yu, Gunhee Kim


Abstract
We present a novel data augmentation technique, CRA (Contextual Response Augmentation), which utilizes conversational context to generate meaningful samples for training. We also mitigate the issues regarding unbalanced context lengths by changing the input output format of the model such that it can deal with varying context lengths effectively. Specifically, our proposed model, trained with the proposed data augmentation technique, participated in the sarcasm detection task of FigLang2020, have won and achieves the best performance in both Reddit and Twitter datasets.
Anthology ID:
2020.figlang-1.2
Volume:
Proceedings of the Second Workshop on Figurative Language Processing
Month:
July
Year:
2020
Address:
Online
Editors:
Beata Beigman Klebanov, Ekaterina Shutova, Patricia Lichtenstein, Smaranda Muresan, Chee Wee, Anna Feldman, Debanjan Ghosh
Venue:
Fig-Lang
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
12–17
Language:
URL:
https://aclanthology.org/2020.figlang-1.2
DOI:
10.18653/v1/2020.figlang-1.2
Bibkey:
Cite (ACL):
Hankyol Lee, Youngjae Yu, and Gunhee Kim. 2020. Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context. In Proceedings of the Second Workshop on Figurative Language Processing, pages 12–17, Online. Association for Computational Linguistics.
Cite (Informal):
Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context (Lee et al., Fig-Lang 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2020.figlang-1.2.pdf
Video:
 http://slideslive.com/38929696