Cosimo Palma
2025
The EuroVoc Thesaurus: Management, Applications, and Future Directions
Lucy Walhain
|
Sébastien Albouze
|
Anikó Gerencsér
|
Mihai Paunescu
|
Vassilis Tzouvaras
|
Cosimo Palma
Proceedings of the 5th Conference on Language, Data and Knowledge
40 This paper provides a comprehensive overview of EuroVoc, the European Union’s multilingual thesaurus. The paper highlights EuroVoc’s significance in the legislative and publications domain, examining its applications in improving information retrieval systems and multi-label text classification methods. Various technological tools developed specifically for EuroVoc classification, including JEX, PyEuroVoc, and KEVLAR, are reviewed, demonstrating the evolution from basic classification systems to sophisticated neural architectures. Additionally, the paper addresses the management practices managing EuroVoc’s continuous updating and expansion through collaborative tools such as VocBench, emphasising the role of interinstitutional committees and specialised teams in maintaining the thesaurus’s accuracy and relevance.A substantial part of the paper is dedicated to EuroVoc’s alignment with other semantic resources like Wikidata and UNESCO, detailing the challenges and methodologies adopted to facilitate semantic interoperability across diverse information systems. Finally, the paper identifies future directions that include modular extensions of EuroVoc, federated models, linked data approaches, thematic hubs, selective integration, and collaborative governance frameworks.
2024
From Linguistic Linked Data to Big Data
Dimitar Trajanov
|
Elena Apostol
|
Radovan Garabik
|
Katerina Gkirtzou
|
Dagmar Gromann
|
Chaya Liebeskind
|
Cosimo Palma
|
Michael Rosner
|
Alexia Sampri
|
Gilles Sérasset
|
Blerina Spahiu
|
Ciprian-Octavian Truică
|
Giedre Valunaite Oleskeviciene
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
With advances in the field of Linked (Open) Data (LOD), language data on the LOD cloud has grown in number, size, and variety. With an increased volume and variety of language data, optimizations of methods for distributing, storing, and querying these data become more central. To this end, this position paper investigates use cases at the intersection of LLOD and Big Data, existing approaches to utilizing Big Data techniques within the context of linked data, and discusses the challenges and benefits of this union.
Search
Fix author
Co-authors
- Sébastien Albouze 1
- Elena Apostol 1
- Radovan Garabík 1
- Anikó Gerencsér 1
- Katerina Gkirtzou 1
- show all...