Abstract
Entity disambiguation with Wikipedia relies on structured information from redirect pages, article text, inter-article links, and categories. We explore whether web links can replace a curated encyclopaedia, obtaining entity prior, name, context, and coherence models from a corpus of web pages with links to Wikipedia. Experiments compare web link models to Wikipedia models on well-known conll and tac data sets. Results show that using 34 million web links approaches Wikipedia performance. Combining web link and Wikipedia models produces the best-known disambiguation accuracy of 88.7 on standard newswire test data.- Anthology ID:
- Q15-1011
- Volume:
- Transactions of the Association for Computational Linguistics, Volume 3
- Month:
- Year:
- 2015
- Address:
- Cambridge, MA
- Venue:
- TACL
- SIG:
- Publisher:
- MIT Press
- Note:
- Pages:
- 145–156
- Language:
- URL:
- https://aclanthology.org/Q15-1011
- DOI:
- 10.1162/tacl_a_00129
- Cite (ACL):
- Andrew Chisholm and Ben Hachey. 2015. Entity Disambiguation with Web Links. Transactions of the Association for Computational Linguistics, 3:145–156.
- Cite (Informal):
- Entity Disambiguation with Web Links (Chisholm & Hachey, TACL 2015)
- PDF:
- https://preview.aclanthology.org/author-url/Q15-1011.pdf
- Code
- wikilinks/nel