A Study on Entity Resolution for Email Conversations

Parag Pravin Dakle, Takshak Desai, Dan Moldovan


Abstract
This paper investigates the problem of entity resolution for email conversations and presents a seed annotated corpus of email threads labeled with entity coreference chains. Characteristics of email threads concerning reference resolution are first discussed, and then the creation of the corpus and annotation steps are explained. Finally, performance of the current state-of-the-art deep learning models on the seed corpus is evaluated and qualitative error analysis on the predictions obtained is presented.
Anthology ID:
2020.lrec-1.8
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
65–73
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.8
DOI:
Bibkey:
Cite (ACL):
Parag Pravin Dakle, Takshak Desai, and Dan Moldovan. 2020. A Study on Entity Resolution for Email Conversations. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 65–73, Marseille, France. European Language Resources Association.
Cite (Informal):
A Study on Entity Resolution for Email Conversations (Dakle et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2020.lrec-1.8.pdf