Hedwig: A Named Entity Linker

Marcus Klang, Pierre Nugues


Abstract
Named entity linking is the task of identifying mentions of named things in text, such as “Barack Obama” or “New York”, and linking these mentions to unique identifiers. In this paper, we describe Hedwig, an end-to-end named entity linker, which uses a combination of word and character BILSTM models for mention detection, a Wikidata and Wikipedia-derived knowledge base with global information aggregated over nine language editions, and a PageRank algorithm for entity linking. We evaluated Hedwig on the TAC2017 dataset, consisting of news texts and discussion forums, and we obtained a final score of 59.9% on CEAFmC+, an improvement over our previous generation linker Ugglan, and a trilingual entity link score of 71.9%.
Anthology ID:
2020.lrec-1.554
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
4501–4508
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.554
DOI:
Bibkey:
Cite (ACL):
Marcus Klang and Pierre Nugues. 2020. Hedwig: A Named Entity Linker. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 4501–4508, Marseille, France. European Language Resources Association.
Cite (Informal):
Hedwig: A Named Entity Linker (Klang & Nugues, LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-2/2020.lrec-1.554.pdf