Nested Named Entity Recognition via Second-best Sequence Learning and Decoding

Takashi Shibuya, Eduard Hovy


Abstract
When an entity name contains other names within it, the identification of all combinations of names can become difficult and expensive. We propose a new method to recognize not only outermost named entities but also inner nested ones. We design an objective function for training a neural model that treats the tag sequence for nested entities as the second best path within the span of their parent entity. In addition, we provide the decoding method for inference that extracts entities iteratively from outermost ones to inner ones in an outside-to-inside way. Our method has no additional hyperparameters to the conditional random field based model widely used for flat named entity recognition tasks. Experiments demonstrate that our method performs better than or at least as well as existing methods capable of handling nested entities, achieving F1-scores of 85.82%, 84.34%, and 77.36% on ACE-2004, ACE-2005, and GENIA datasets, respectively.
Anthology ID:
2020.tacl-1.39
Volume:
Transactions of the Association for Computational Linguistics, Volume 8
Month:
Year:
2020
Address:
Cambridge, MA
Editors:
Mark Johnson, Brian Roark, Ani Nenkova
Venue:
TACL
SIG:
Publisher:
MIT Press
Note:
Pages:
605–620
Language:
URL:
https://aclanthology.org/2020.tacl-1.39
DOI:
10.1162/tacl_a_00334
Bibkey:
Cite (ACL):
Takashi Shibuya and Eduard Hovy. 2020. Nested Named Entity Recognition via Second-best Sequence Learning and Decoding. Transactions of the Association for Computational Linguistics, 8:605–620.
Cite (Informal):
Nested Named Entity Recognition via Second-best Sequence Learning and Decoding (Shibuya & Hovy, TACL 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl24-info/2020.tacl-1.39.pdf