Modeling Gender and Dialect Bias in Automatic Speech Recognition

Camille Harris; Chijioke Mgbahurike; Neha Kumar; Diyi Yang

doi:10.18653/v1/2024.findings-emnlp.890

Modeling Gender and Dialect Bias in Automatic Speech Recognition

Camille Harris, Chijioke Mgbahurike, Neha Kumar, Diyi Yang

Abstract

Dialect and gender-based biases have become an area of concern in language-dependent AI systemsincluding around automatic speech recognition (ASR) which processes speech audio into text. These potential biases raise concern for discriminatory outcomes with AI systems depending on demographic- particularly gender discrimination against women, and racial discrimination against minorities with ethnic or cultural English dialects.As such we aim to evaluate the performance of ASR systems across different genders and across dialects of English. Concretely, we take a deep dive of the performance of ASR systems on men and women across four US-based English dialects: Standard American English (SAE), African American Vernacular English (AAVE), Chicano English, and Spanglish. To do this, we construct a labeled dataset of 13 hours of podcast audio, transcribed by speakers of the represented dialects. We then evaluate zero-shot performance of different automatic speech recognition models on our dataset, and further finetune models to better understand how finetuning can impact performance. Our work fills the gap of investigating possible gender disparities within underrepresented dialects.

Anthology ID:: 2024.findings-emnlp.890
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2024
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15166–15184
Language:
URL:: https://preview.aclanthology.org/fix-sig-urls/2024.findings-emnlp.890/
DOI:: 10.18653/v1/2024.findings-emnlp.890
Bibkey:
Cite (ACL):: Camille Harris, Chijioke Mgbahurike, Neha Kumar, and Diyi Yang. 2024. Modeling Gender and Dialect Bias in Automatic Speech Recognition. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 15166–15184, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Modeling Gender and Dialect Bias in Automatic Speech Recognition (Harris et al., Findings 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/fix-sig-urls/2024.findings-emnlp.890.pdf

PDF Cite Search Fix data