Fine-grained Typing of Emerging Entities in Microblogs

Satoshi Akasaki, Naoki Yoshinaga, Masashi Toyoda


Abstract
Analyzing microblogs where we post what we experience enables us to perform various applications such as social-trend analysis and entity recommendation. To track emerging trends in a variety of areas, we want to categorize information on emerging entities (e.g., Avatar 2) in microblog posts according to their types (e.g., Film). We thus introduce a new entity typing task that assigns a fine-grained type to each emerging entity when a burst of posts containing that entity is first observed in a microblog. The challenge is to perform typing from noisy microblog posts without relying on prior knowledge of the target entity. To tackle this task, we build large-scale Twitter datasets for English and Japanese using time-sensitive distant supervision. We then propose a modular neural typing model that encodes not only the entity and its contexts but also meta information in multiple posts. To type ‘homographic’ emerging entities (e.g., ‘Go’ means an emerging programming language and a classic board game), which contexts are noisy, we devise a context selector that finds related contexts of the target entity. Experiments on the Twitter datasets confirm the effectiveness of our typing model and the context selector.
Anthology ID:
2021.findings-emnlp.399
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2021
Month:
November
Year:
2021
Address:
Punta Cana, Dominican Republic
Editors:
Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:
Findings
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
4667–4679
Language:
URL:
https://aclanthology.org/2021.findings-emnlp.399
DOI:
10.18653/v1/2021.findings-emnlp.399
Bibkey:
Cite (ACL):
Satoshi Akasaki, Naoki Yoshinaga, and Masashi Toyoda. 2021. Fine-grained Typing of Emerging Entities in Microblogs. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 4667–4679, Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
Fine-grained Typing of Emerging Entities in Microblogs (Akasaki et al., Findings 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2021.findings-emnlp.399.pdf
Software:
 2021.findings-emnlp.399.Software.zip
Video:
 https://preview.aclanthology.org/emnlp-22-attachments/2021.findings-emnlp.399.mp4
Data
DBpedia