Large-scale Cross-lingual Language Resources for Referencing and Framing

Piek Vossen, Filip Ilievski, Marten Postma, Antske Fokkens, Gosse Minnema, Levi Remijnse


Abstract
In this article, we lay out the basic ideas and principles of the project Framing Situations in the Dutch Language. We provide our first results of data acquisition, together with the first data release. We introduce the notion of cross-lingual referential corpora. These corpora consist of texts that make reference to exactly the same incidents. The referential grounding allows us to analyze the framing of these incidents in different languages and across different texts. During the project, we will use the automatically generated data to study linguistic framing as a phenomenon, build framing resources such as lexicons and corpora. We expect to capture larger variation in framing compared to traditional approaches for building such resources. Our first data release, which contains structured data about a large number of incidents and reference texts, can be found at http://dutchframenet.nl/data-releases/.
Anthology ID:
2020.lrec-1.387
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3162–3171
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.387
DOI:
Bibkey:
Cite (ACL):
Piek Vossen, Filip Ilievski, Marten Postma, Antske Fokkens, Gosse Minnema, and Levi Remijnse. 2020. Large-scale Cross-lingual Language Resources for Referencing and Framing. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 3162–3171, Marseille, France. European Language Resources Association.
Cite (Informal):
Large-scale Cross-lingual Language Resources for Referencing and Framing (Vossen et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/2020.lrec-1.387.pdf