ASU: An Experimental Study on Applying Deep Learning in Twitter Named Entity Recognition.

Michel Naim Gerguis, Cherif Salama, M. Watheq El-Kharashi


Abstract
This paper describes the ASU system submitted in the COLING W-NUT 2016 Twitter Named Entity Recognition (NER) task. We present an experimental study on applying deep learning to extracting named entities (NEs) from tweets. We built two Long Short-Term Memory (LSTM) models for the task. The first model was built to extract named entities without types while the second model was built to extract and then classify them into 10 fine-grained entity classes. In this effort, we show detailed experimentation results on the effectiveness of word embeddings, brown clusters, part-of-speech (POS) tags, shape features, gazetteers, and local context for the tweet input vector representation to the LSTM model. Also, we present a set of experiments, to better design the network parameters for the Twitter NER task. Our system was ranked the fifth out of ten participants with a final f1-score for the typed classes of 39% and 55% for the non typed ones.
Anthology ID:
W16-3925
Volume:
Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
WNUT | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
188–196
Language:
URL:
https://aclanthology.org/W16-3925
DOI:
Bibkey:
Cite (ACL):
Michel Naim Gerguis, Cherif Salama, and M. Watheq El-Kharashi. 2016. ASU: An Experimental Study on Applying Deep Learning in Twitter Named Entity Recognition.. In Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT), pages 188–196, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
ASU: An Experimental Study on Applying Deep Learning in Twitter Named Entity Recognition. (Gerguis et al., 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/W16-3925.pdf
Data
WNUT 2016 NER