Abstract
In this paper, we report the release of the ACoLi Dictionary Graph, a large-scale collection of multilingual open source dictionaries available in two machine-readable formats, a graph representation in RDF, using the OntoLex-Lemon vocabulary, and a simple tabular data format to facilitate their use in NLP tasks, such as translation inference across dictionaries. We describe the mapping and harmonization of the underlying data structures into a unified representation, its serialization in RDF and TSV, and the release of a massive and coherent amount of lexical data under open licenses.- Anthology ID:
- 2020.lrec-1.401
- Volume:
- Proceedings of the Twelfth Language Resources and Evaluation Conference
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Editors:
- Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 3281–3290
- Language:
- English
- URL:
- https://aclanthology.org/2020.lrec-1.401
- DOI:
- Cite (ACL):
- Christian Chiarcos, Christian Fäth, and Maxim Ionov. 2020. The ACoLi Dictionary Graph. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 3281–3290, Marseille, France. European Language Resources Association.
- Cite (Informal):
- The ACoLi Dictionary Graph (Chiarcos et al., LREC 2020)
- PDF:
- https://preview.aclanthology.org/ingest-acl-2023-videos/2020.lrec-1.401.pdf