Improving Entity Linking by Encoding Type Information into Entity Embeddings

Li Tianran, Yang Erguang, Zhang Yujie, Chen Yufeng, Xu Jinan


Abstract
Entity Linking (EL) refers to the task of linking entity mentions in the text to the correct entities inthe Knowledge Base (KB) in which entity embeddings play a vital and challenging role because of the subtle differences between entities. However existing pre-trained entity embeddings onlylearn the underlying semantic information in texts yet the fine-grained entity type informationis ignored which causes the type of the linked entity is incompatible with the mention context.In order to solve this problem we propose to encode fine-grained type information into entity embeddings. We firstly pre-train word vectors to inject type information by embedding wordsand fine-grained entity types into the same vector space. Then we retrain entity embeddings withword vectors containing fine-grained type information. By applying our entity embeddings to twoexisting EL models our method respectively achieves 0.82% and 0.42% improvement on average F1 score of the test sets. Meanwhile our method is model-irrelevant which means it can helpother EL models.
Anthology ID:
2021.ccl-1.97
Volume:
Proceedings of the 20th Chinese National Conference on Computational Linguistics
Month:
August
Year:
2021
Address:
Huhhot, China
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
1087–1095
Language:
English
URL:
https://aclanthology.org/2021.ccl-1.97
DOI:
Bibkey:
Cite (ACL):
Li Tianran, Yang Erguang, Zhang Yujie, Chen Yufeng, and Xu Jinan. 2021. Improving Entity Linking by Encoding Type Information into Entity Embeddings. In Proceedings of the 20th Chinese National Conference on Computational Linguistics, pages 1087–1095, Huhhot, China. Chinese Information Processing Society of China.
Cite (Informal):
Improving Entity Linking by Encoding Type Information into Entity Embeddings (Tianran et al., CCL 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2021.ccl-1.97.pdf
Data
FIGER