Resolving Gendered Ambiguous Pronouns with BERT

Matei Ionita; Yury Kashnitsky; Ken Krige; Vladimir Larin; Atanas Atanasov; Dennis Logvinenko

doi:10.18653/v1/W19-3817

Resolving Gendered Ambiguous Pronouns with BERT

Matei Ionita, Yury Kashnitsky, Ken Krige, Vladimir Larin, Atanas Atanasov, Dennis Logvinenko

Abstract

Pronoun resolution is part of coreference resolution, the task of pairing an expression to its referring entity. This is an important task for natural language understanding and a necessary component of machine translation systems, chat bots and assistants. Neural machine learning systems perform far from ideally in this task, reaching as low as 73% F1 scores on modern benchmark datasets. Moreover, they tend to perform better for masculine pronouns than for feminine ones. Thus, the problem is both challenging and important for NLP researchers and practitioners. In this project, we describe our BERT-based approach to solving the problem of gender-balanced pronoun resolution. We are able to reach 92% F1 score and a much lower gender bias on the benchmark dataset shared by Google AI Language team.

Anthology ID:: W19-3817
Volume:: Proceedings of the First Workshop on Gender Bias in Natural Language Processing
Month:: August
Year:: 2019
Address:: Florence, Italy
Venue:: GeBNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 113–119
Language:
URL:: https://aclanthology.org/W19-3817
DOI:: 10.18653/v1/W19-3817
Bibkey:
Cite (ACL):: Matei Ionita, Yury Kashnitsky, Ken Krige, Vladimir Larin, Atanas Atanasov, and Dennis Logvinenko. 2019. Resolving Gendered Ambiguous Pronouns with BERT. In Proceedings of the First Workshop on Gender Bias in Natural Language Processing, pages 113–119, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):: Resolving Gendered Ambiguous Pronouns with BERT (Ionita et al., GeBNLP 2019)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingestion-script-update/W19-3817.pdf
Code: Yorko/gender-unbiased_BERT-based_pronoun_resolution
Data: GAP Coreference Dataset

PDF Search Code