An Annotated Corpus of Adjective-Adverb Interfaces in Romance Languages

Katharina Gerhalter, Gerlinde Schneider, Christopher Pollin, Martin Hummel


Abstract
The final outcome of the project Open Access Database: Adjective-Adverb Interfaces in Romance is an annotated and lemmatised corpus of various linguistic phenomena related to Romance adjectives with adverbial functions. The data is published under open-access and aims to serve linguistic research based on transparent and accessible corpus-based data. The annotation model was developed to offer a cross-linguistic categorization model for the heterogeneous word-class “adverb”, based on its diverse forms, functions and meanings. The project focuses on the interoperability and accessibility of data, with particular respect to reusability in the sense of the FAIR Data Principles. Topics presented by this paper include data compilation and creation, annotation in XML/TEI, data preservation and publication process by means of the GAMS repository and accessibility via a search interface. These aspects are tied together by semantic technologies, using an ontology-based approach.
Anthology ID:
2020.lrec-1.120
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
953–957
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.120
DOI:
Bibkey:
Cite (ACL):
Katharina Gerhalter, Gerlinde Schneider, Christopher Pollin, and Martin Hummel. 2020. An Annotated Corpus of Adjective-Adverb Interfaces in Romance Languages. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 953–957, Marseille, France. European Language Resources Association.
Cite (Informal):
An Annotated Corpus of Adjective-Adverb Interfaces in Romance Languages (Gerhalter et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2020.lrec-1.120.pdf