TalkDown: A Corpus for Condescension Detection in Context

Zijian Wang, Christopher Potts


Abstract
Condescending language use is caustic; it can bring dialogues to an end and bifurcate communities. Thus, systems for condescension detection could have a large positive impact. A challenge here is that condescension is often impossible to detect from isolated utterances, as it depends on the discourse and social context. To address this, we present TalkDown, a new labeled dataset of condescending linguistic acts in context. We show that extending a language-only model with representations of the discourse improves performance, and we motivate techniques for dealing with the low rates of condescension overall. We also use our model to estimate condescension rates in various online communities and relate these differences to differing community norms.
Anthology ID:
D19-1385
Volume:
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
Month:
November
Year:
2019
Address:
Hong Kong, China
Venues:
EMNLP | IJCNLP
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
3711–3719
Language:
URL:
https://aclanthology.org/D19-1385
DOI:
10.18653/v1/D19-1385
Bibkey:
Cite (ACL):
Zijian Wang and Christopher Potts. 2019. TalkDown: A Corpus for Condescension Detection in Context. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3711–3719, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
TalkDown: A Corpus for Condescension Detection in Context (Wang & Potts, EMNLP-IJCNLP 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/D19-1385.pdf
Code
 zijwang/talkdown
Data
TalkDown