Abstract
Methods for Named Entity Recognition and Disambiguation (NERD) perform NER and NED in two separate stages. Therefore, NED may be penalized with respect to precision by NER false positives, and suffers in recall from NER false negatives. Conversely, NED does not fully exploit information computed by NER such as types of mentions. This paper presents J-NERD, a new approach to perform NER and NED jointly, by means of a probabilistic graphical model that captures mention spans, mention types, and the mapping of mentions to entities in a knowledge base. We present experiments with different kinds of texts from the CoNLL’03, ACE’05, and ClueWeb’09-FACC1 corpora. J-NERD consistently outperforms state-of-the-art competitors in end-to-end NERD precision, recall, and F1.- Anthology ID:
- Q16-1016
- Volume:
- Transactions of the Association for Computational Linguistics, Volume 4
- Month:
- Year:
- 2016
- Address:
- Cambridge, MA
- Venue:
- TACL
- SIG:
- Publisher:
- MIT Press
- Note:
- Pages:
- 215–229
- Language:
- URL:
- https://aclanthology.org/Q16-1016
- DOI:
- 10.1162/tacl_a_00094
- Cite (ACL):
- Dat Ba Nguyen, Martin Theobald, and Gerhard Weikum. 2016. J-NERD: Joint Named Entity Recognition and Disambiguation with Rich Linguistic Features. Transactions of the Association for Computational Linguistics, 4:215–229.
- Cite (Informal):
- J-NERD: Joint Named Entity Recognition and Disambiguation with Rich Linguistic Features (Nguyen et al., TACL 2016)
- PDF:
- https://preview.aclanthology.org/paclic-22-ingestion/Q16-1016.pdf