Abstract
This paper presents RELATE (http://relate.racai.ro), a high-performance natural language platform designed for Romanian language. It is meant both for demonstration of available services, from text-span annotations to syntactic dependency trees as well as playing or automatically synthesizing Romanian words, and for the development of new annotated corpora. It also incorporates the search engines for the large COROLA reference corpus of contemporary Romanian and the Romanian wordnet. It integrates multiple text and speech processing modules and exposes their functionality through a web interface designed for the linguist researcher. It makes use of a scheduler-runner architecture, allowing processing to be distributed across multiple computing nodes. A series of input/output converters allows large corpora to be loaded, processed and exported according to user preferences.- Anthology ID:
- 2020.iwltp-1.13
- Volume:
- Proceedings of the 1st International Workshop on Language Technology Platforms
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Editors:
- Georg Rehm, Kalina Bontcheva, Khalid Choukri, Jan Hajič, Stelios Piperidis, Andrejs Vasiļjevs
- Venue:
- IWLTP
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 81–88
- Language:
- English
- URL:
- https://aclanthology.org/2020.iwltp-1.13
- DOI:
- Cite (ACL):
- Vasile Păiș, Radu Ion, and Dan Tufiș. 2020. A Processing Platform Relating Data and Tools for Romanian Language. In Proceedings of the 1st International Workshop on Language Technology Platforms, pages 81–88, Marseille, France. European Language Resources Association.
- Cite (Informal):
- A Processing Platform Relating Data and Tools for Romanian Language (Păiș et al., IWLTP 2020)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/2020.iwltp-1.13.pdf