The EuroVoc Thesaurus: Management, Applications, and Future Directions

Lucy Walhain, Sébastien Albouze, Anikó Gerencsér, Mihai Paunescu, Vassilis Tzouvaras, Cosimo Palma


Abstract
40 This paper provides a comprehensive overview of EuroVoc, the European Union’s multilingual thesaurus. The paper highlights EuroVoc’s significance in the legislative and publications domain, examining its applications in improving information retrieval systems and multi-label text classification methods. Various technological tools developed specifically for EuroVoc classification, including JEX, PyEuroVoc, and KEVLAR, are reviewed, demonstrating the evolution from basic classification systems to sophisticated neural architectures. Additionally, the paper addresses the management practices managing EuroVoc’s continuous updating and expansion through collaborative tools such as VocBench, emphasising the role of interinstitutional committees and specialised teams in maintaining the thesaurus’s accuracy and relevance.A substantial part of the paper is dedicated to EuroVoc’s alignment with other semantic resources like Wikidata and UNESCO, detailing the challenges and methodologies adopted to facilitate semantic interoperability across diverse information systems. Finally, the paper identifies future directions that include modular extensions of EuroVoc, federated models, linked data approaches, thematic hubs, selective integration, and collaborative governance frameworks.
Anthology ID:
2025.ldk-1.34
Volume:
Proceedings of the 5th Conference on Language, Data and Knowledge
Month:
September
Year:
2025
Address:
Naples, Italy
Editors:
Mehwish Alam, Andon Tchechmedjiev, Jorge Gracia, Dagmar Gromann, Maria Pia di Buono, Johanna Monti, Maxim Ionov
Venues:
LDK | WS
SIG:
Publisher:
Unior Press
Note:
Pages:
340–350
Language:
URL:
https://preview.aclanthology.org/ldl-25-ingestion/2025.ldk-1.34/
DOI:
Bibkey:
Cite (ACL):
Lucy Walhain, Sébastien Albouze, Anikó Gerencsér, Mihai Paunescu, Vassilis Tzouvaras, and Cosimo Palma. 2025. The EuroVoc Thesaurus: Management, Applications, and Future Directions. In Proceedings of the 5th Conference on Language, Data and Knowledge, pages 340–350, Naples, Italy. Unior Press.
Cite (Informal):
The EuroVoc Thesaurus: Management, Applications, and Future Directions (Walhain et al., LDK 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ldl-25-ingestion/2025.ldk-1.34.pdf