Combining Deep Learning Models and Lexical Linked Data: Some Insights from the Development of a Multilingual News Named Entity Recognition and Linking Dataset

Emmanuel Cartier, Emile Peetermans


Abstract
This paper presents the methodology and outcomes of a Named Entity Recognition and Linking multilingual news benchmark that leverages both Deep learning approaches by using a fine-tuned transformer model to detect mentions of persons, locations and organisations in text, and Linguistic Linked Open Data, through the use of Wikidata to disambiguate mentions and link them to ontology entries. It shows all the advantages of combining both approaches, not only for building the benchmark but also for fine-tuning detection models. We also insist on several perspectives of research to improve the accuracy of a combining system and go further on leveraging the complementary approaches.
Anthology ID:
2024.dlnld-1.3
Volume:
Proceedings of the Workshop on Deep Learning and Linked Data (DLnLD) @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Gilles Sérasset, Hugo Gonçalo Oliveira, Giedre Valunaite Oleskeviciene
Venues:
DLnLD | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
31–44
Language:
URL:
https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2024.dlnld-1.3/
DOI:
Bibkey:
Cite (ACL):
Emmanuel Cartier and Emile Peetermans. 2024. Combining Deep Learning Models and Lexical Linked Data: Some Insights from the Development of a Multilingual News Named Entity Recognition and Linking Dataset. In Proceedings of the Workshop on Deep Learning and Linked Data (DLnLD) @ LREC-COLING 2024, pages 31–44, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Combining Deep Learning Models and Lexical Linked Data: Some Insights from the Development of a Multilingual News Named Entity Recognition and Linking Dataset (Cartier & Peetermans, DLnLD 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2024.dlnld-1.3.pdf