Label distributions help implicit discourse relation classification
Frances Yung, Kaveri Anuranjana, Merel Scholman, Vera Demberg
Abstract
Implicit discourse relations can convey more than one relation sense, but much of the research on discourse relations has focused on single relation senses. Recently, DiscoGeM, a novel multi-domain corpus, which contains 10 crowd-sourced labels per relational instance, has become available. In this paper, we analyse the co-occurrences of relations in DiscoGem and show that they are systematic and characteristic of text genre. We then test whether information on multi-label distributions in the data can help implicit relation classifiers. Our results show that incorporating multiple labels in parser training can improve its performance, and yield label distributions which are more similar to human label distributions, compared to a parser that is trained on just a single most frequent label per instance.- Anthology ID:
- 2022.codi-1.7
- Volume:
- Proceedings of the 3rd Workshop on Computational Approaches to Discourse
- Month:
- October
- Year:
- 2022
- Address:
- Gyeongju, Republic of Korea and Online
- Venue:
- CODI
- SIG:
- Publisher:
- International Conference on Computational Linguistics
- Note:
- Pages:
- 48–53
- Language:
- URL:
- https://aclanthology.org/2022.codi-1.7
- DOI:
- Cite (ACL):
- Frances Yung, Kaveri Anuranjana, Merel Scholman, and Vera Demberg. 2022. Label distributions help implicit discourse relation classification. In Proceedings of the 3rd Workshop on Computational Approaches to Discourse, pages 48–53, Gyeongju, Republic of Korea and Online. International Conference on Computational Linguistics.
- Cite (Informal):
- Label distributions help implicit discourse relation classification (Yung et al., CODI 2022)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/2022.codi-1.7.pdf