Resolving Gendered Ambiguous Pronouns with BERT
Matei Ionita, Yury Kashnitsky, Ken Krige, Vladimir Larin, Atanas Atanasov, Dennis Logvinenko
Abstract
Pronoun resolution is part of coreference resolution, the task of pairing an expression to its referring entity. This is an important task for natural language understanding and a necessary component of machine translation systems, chat bots and assistants. Neural machine learning systems perform far from ideally in this task, reaching as low as 73% F1 scores on modern benchmark datasets. Moreover, they tend to perform better for masculine pronouns than for feminine ones. Thus, the problem is both challenging and important for NLP researchers and practitioners. In this project, we describe our BERT-based approach to solving the problem of gender-balanced pronoun resolution. We are able to reach 92% F1 score and a much lower gender bias on the benchmark dataset shared by Google AI Language team.- Anthology ID:
- W19-3817
- Volume:
- Proceedings of the First Workshop on Gender Bias in Natural Language Processing
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Venue:
- GeBNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 113–119
- Language:
- URL:
- https://aclanthology.org/W19-3817
- DOI:
- 10.18653/v1/W19-3817
- Cite (ACL):
- Matei Ionita, Yury Kashnitsky, Ken Krige, Vladimir Larin, Atanas Atanasov, and Dennis Logvinenko. 2019. Resolving Gendered Ambiguous Pronouns with BERT. In Proceedings of the First Workshop on Gender Bias in Natural Language Processing, pages 113–119, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Resolving Gendered Ambiguous Pronouns with BERT (Ionita et al., GeBNLP 2019)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/W19-3817.pdf
- Code
- Yorko/gender-unbiased_BERT-based_pronoun_resolution
- Data
- GAP Coreference Dataset