Towards Understanding Gender Bias in Relation Extraction

Andrew Gaut; Tony Sun; Shirlyn Tang; Yuxin Huang; Jing Qian; Mai ElSherief; Jieyu Zhao; Diba Mirza; Elizabeth Belding; Kai-Wei Chang; William Yang Wang

doi:10.18653/v1/2020.acl-main.265

Towards Understanding Gender Bias in Relation Extraction

Andrew Gaut, Tony Sun, Shirlyn Tang, Yuxin Huang, Jing Qian, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, William Yang Wang

Abstract

Recent developments in Neural Relation Extraction (NRE) have made significant strides towards Automated Knowledge Base Construction. While much attention has been dedicated towards improvements in accuracy, there have been no attempts in the literature to evaluate social biases exhibited in NRE systems. In this paper, we create WikiGenderBias, a distantly supervised dataset composed of over 45,000 sentences including a 10% human annotated test set for the purpose of analyzing gender bias in relation extraction systems. We find that when extracting spouse-of and hypernym (i.e., occupation) relations, an NRE system performs differently when the gender of the target entity is different. However, such disparity does not appear when extracting relations such as birthDate or birthPlace. We also analyze how existing bias mitigation techniques, such as name anonymization, word embedding debiasing, and data augmentation affect the NRE system in terms of maintaining the test performance and reducing biases. Unfortunately, due to NRE models rely heavily on surface level cues, we find that existing bias mitigation approaches have a negative effect on NRE. Our analysis lays groundwork for future quantifying and mitigating bias in NRE.

Anthology ID:: 2020.acl-main.265
Volume:: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:: July
Year:: 2020
Address:: Online
Editors:: Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2943–2953
Language:
URL:: https://aclanthology.org/2020.acl-main.265
DOI:: 10.18653/v1/2020.acl-main.265
Bibkey:
Cite (ACL):: Andrew Gaut, Tony Sun, Shirlyn Tang, Yuxin Huang, Jing Qian, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, and William Yang Wang. 2020. Towards Understanding Gender Bias in Relation Extraction. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 2943–2953, Online. Association for Computational Linguistics.
Cite (Informal):: Towards Understanding Gender Bias in Relation Extraction (Gaut et al., ACL 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/landing_page/2020.acl-main.265.pdf
Dataset:: 2020.acl-main.265.Dataset.zip
Video:: http://slideslive.com/38929244

PDF Search Dataset Video