OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian
Ranka Stanković, Maxim Ionov, Medina Bajtarević, Lorena Ninčević
Abstract
This paper introduces a novel language resource for retrieving and researching verbal aspectual pairs in BCS (Bosnian, Croatian, and Serbian) created using Linguistic Linked Open Data (LLOD) principles. As there is no resource to help learners of Bosnian, Croatian, and Serbian as foreign languages to recognize the aspect of a verb or its pairs, we have created a new resource that will provide users with information about the aspect, as well as the link to a verb’s aspectual counterparts. This resource also contains external links to monolingual dictionaries, Wordnet, and BabelNet. As this is a work in progress, our resource only includes verbs and their perfective pairs formed with prefixes “pro”, “od”, “ot”, “iz”, “is” and “na”. The goal of this project is to have a complete dataset of all the aspectual pairs in these three languages. We believe it will be useful for research in the field of aspectology, as well as machine translation and other NLP tasks. Using this resource as an example, we also propose a sustainable approach to publishing small to moderate LLOD resources on the Web, both in a user-friendly way and according to the Linked Data principles.- Anthology ID:
- 2024.ldl-1.14
- Volume:
- Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024
- Month:
- May
- Year:
- 2024
- Address:
- Torino, Italia
- Editors:
- Christian Chiarcos, Katerina Gkirtzou, Maxim Ionov, Fahad Khan, John P. McCrae, Elena Montiel Ponsoda, Patricia Martín Chozas
- Venues:
- LDL | WS
- SIG:
- Publisher:
- ELRA and ICCL
- Note:
- Pages:
- 108–114
- Language:
- URL:
- https://aclanthology.org/2024.ldl-1.14
- DOI:
- Cite (ACL):
- Ranka Stanković, Maxim Ionov, Medina Bajtarević, and Lorena Ninčević. 2024. OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian. In Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, pages 108–114, Torino, Italia. ELRA and ICCL.
- Cite (Informal):
- OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian (Stanković et al., LDL-WS 2024)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2024.ldl-1.14.pdf