Tuning Into Bias: A Computational Study of Gender Bias in Song Lyrics

Danqing Chen, Adithi Satish, Rasul Khanbayov, Carolin Schuster, Georg Groh


Abstract
The application of text mining methods is becoming increasingly prevalent, particularly within Humanities and Computational Social Sciences, as well as in a broader range of disciplines. This paper presents an analysis of gender bias in English song lyrics using topic modeling and bias measurement techniques. Leveraging BERTopic, we cluster a dataset of 537,553 English songs into distinct topics and analyze their temporal evolution. Our results reveal a significant thematic shift in song lyrics over time, transitioning from romantic themes to a heightened focus on the sexualization of women. Additionally, we observe a substantial prevalence of profanity and misogynistic content across various topics, with a particularly high concentration in the largest thematic cluster. To further analyse gender bias across topics and genres in a quantitative way, we employ the Single Category Word Embedding Association Test (SC-WEAT) to calculate bias scores for word embeddings trained on the most prominent topics as well as individual genres. The results indicate a consistent male bias in words associated with intelligence and strength, while appearance and weakness words show a female bias. Further analysis highlights variations in these biases across topics, illustrating the interplay between thematic content and gender stereotypes in song lyrics.
Anthology ID:
2025.latechclfl-1.12
Volume:
Proceedings of the 9th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2025)
Month:
May
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Anna Kazantseva, Stan Szpakowicz, Stefania Degaetano-Ortlieb, Yuri Bizzoni, Janis Pagel
Venues:
LaTeCHCLfL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
117–129
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.latechclfl-1.12/
DOI:
Bibkey:
Cite (ACL):
Danqing Chen, Adithi Satish, Rasul Khanbayov, Carolin Schuster, and Georg Groh. 2025. Tuning Into Bias: A Computational Study of Gender Bias in Song Lyrics. In Proceedings of the 9th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2025), pages 117–129, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Tuning Into Bias: A Computational Study of Gender Bias in Song Lyrics (Chen et al., LaTeCHCLfL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.latechclfl-1.12.pdf