Label distributions help implicit discourse relation classification

Frances Yung, Kaveri Anuranjana, Merel Scholman, Vera Demberg


Abstract
Implicit discourse relations can convey more than one relation sense, but much of the research on discourse relations has focused on single relation senses. Recently, DiscoGeM, a novel multi-domain corpus, which contains 10 crowd-sourced labels per relational instance, has become available. In this paper, we analyse the co-occurrences of relations in DiscoGem and show that they are systematic and characteristic of text genre. We then test whether information on multi-label distributions in the data can help implicit relation classifiers. Our results show that incorporating multiple labels in parser training can improve its performance, and yield label distributions which are more similar to human label distributions, compared to a parser that is trained on just a single most frequent label per instance.
Anthology ID:
2022.codi-1.7
Volume:
Proceedings of the 3rd Workshop on Computational Approaches to Discourse
Month:
October
Year:
2022
Address:
Gyeongju, Republic of Korea and Online
Venue:
CODI
SIG:
Publisher:
International Conference on Computational Linguistics
Note:
Pages:
48–53
Language:
URL:
https://aclanthology.org/2022.codi-1.7
DOI:
Bibkey:
Cite (ACL):
Frances Yung, Kaveri Anuranjana, Merel Scholman, and Vera Demberg. 2022. Label distributions help implicit discourse relation classification. In Proceedings of the 3rd Workshop on Computational Approaches to Discourse, pages 48–53, Gyeongju, Republic of Korea and Online. International Conference on Computational Linguistics.
Cite (Informal):
Label distributions help implicit discourse relation classification (Yung et al., CODI 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/2022.codi-1.7.pdf