Abstract
In this paper, we present our approach for named entity recognition in Twitter messages that we used in our participation in the Named Entity Recognition in Twitter shared task at the COLING 2016 Workshop on Noisy User-generated text (WNUT). The main challenge that we aim to tackle in our participation is the short, noisy and colloquial nature of tweets, which makes named entity recognition in Twitter message a challenging task. In particular, we investigate an approach for dealing with this problem by enabling bidirectional long short-term memory (LSTM) to automatically learn orthographic features without requiring feature engineering. In comparison with other systems participating in the shared task, our system achieved the most effective performance on both the ‘segmentation and categorisation’ and the ‘segmentation only’ sub-tasks.- Anthology ID:
- W16-3920
- Volume:
- Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT)
- Month:
- December
- Year:
- 2016
- Address:
- Osaka, Japan
- Editors:
- Bo Han, Alan Ritter, Leon Derczynski, Wei Xu, Tim Baldwin
- Venue:
- WNUT
- SIG:
- Publisher:
- The COLING 2016 Organizing Committee
- Note:
- Pages:
- 145–152
- Language:
- URL:
- https://aclanthology.org/W16-3920
- DOI:
- Cite (ACL):
- Nut Limsopatham and Nigel Collier. 2016. Bidirectional LSTM for Named Entity Recognition in Twitter Messages. In Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT), pages 145–152, Osaka, Japan. The COLING 2016 Organizing Committee.
- Cite (Informal):
- Bidirectional LSTM for Named Entity Recognition in Twitter Messages (Limsopatham & Collier, WNUT 2016)
- PDF:
- https://preview.aclanthology.org/ml4al-ingestion/W16-3920.pdf
- Data
- WNUT 2016 NER