Bidirectional LSTM for Named Entity Recognition in Twitter Messages

Nut Limsopatham, Nigel Collier


Abstract
In this paper, we present our approach for named entity recognition in Twitter messages that we used in our participation in the Named Entity Recognition in Twitter shared task at the COLING 2016 Workshop on Noisy User-generated text (WNUT). The main challenge that we aim to tackle in our participation is the short, noisy and colloquial nature of tweets, which makes named entity recognition in Twitter message a challenging task. In particular, we investigate an approach for dealing with this problem by enabling bidirectional long short-term memory (LSTM) to automatically learn orthographic features without requiring feature engineering. In comparison with other systems participating in the shared task, our system achieved the most effective performance on both the ‘segmentation and categorisation’ and the ‘segmentation only’ sub-tasks.
Anthology ID:
W16-3920
Volume:
Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT)
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Bo Han, Alan Ritter, Leon Derczynski, Wei Xu, Tim Baldwin
Venue:
WNUT
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
145–152
Language:
URL:
https://aclanthology.org/W16-3920
DOI:
Bibkey:
Cite (ACL):
Nut Limsopatham and Nigel Collier. 2016. Bidirectional LSTM for Named Entity Recognition in Twitter Messages. In Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT), pages 145–152, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Bidirectional LSTM for Named Entity Recognition in Twitter Messages (Limsopatham & Collier, WNUT 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ml4al-ingestion/W16-3920.pdf
Data
WNUT 2016 NER