The long and the short of it: DRASTIC, a semantically annotated dataset containing sentences of more natural length

Dag Haug, Jamie Yates Findlay, Ahmet Yildirim


Abstract
This paper presents a new dataset with Discourse Representation Structures (DRSs) annotated over naturally-occurring sentences. Importantly, these sentences are more varied in length and on average longer than those in the existing gold-standard DRS dataset, the Parallel Meaning Bank, and we show that they are therefore much harder for parsers. We argue, though, that this provides a more realistic assessment of the difficulties of DRS parsing.
Anthology ID:
2023.dmr-1.9
Volume:
Proceedings of the Fourth International Workshop on Designing Meaning Representations
Month:
June
Year:
2023
Address:
Nancy, France
Editors:
Julia Bonn, Nianwen Xue
Venues:
DMR | WS
SIG:
SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
89–98
Language:
URL:
https://aclanthology.org/2023.dmr-1.9
DOI:
Bibkey:
Cite (ACL):
Dag Haug, Jamie Yates Findlay, and Ahmet Yildirim. 2023. The long and the short of it: DRASTIC, a semantically annotated dataset containing sentences of more natural length. In Proceedings of the Fourth International Workshop on Designing Meaning Representations, pages 89–98, Nancy, France. Association for Computational Linguistics.
Cite (Informal):
The long and the short of it: DRASTIC, a semantically annotated dataset containing sentences of more natural length (Haug et al., DMR-WS 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-3/2023.dmr-1.9.pdf