WoNBias: A Dataset for Classifying Bias & Prejudice Against Women in Bengali Text

Md. Raisul Islam Aupi, Nishat Tafannum, Md. Shahidur Rahman, Kh Mahmudul Hassan, Naimur Rahman


Abstract
This paper presents WoNBias, a curated Bengali dataset to identify gender-based biases, stereotypes, and harmful language directed at women. It merges digital sources- social media, blogs, news- with offline tactics comprising surveys and focus groups, alongside some existing corpora to compile a total of 31,484 entries (10,656 negative; 10,170 positive; 10,658 neutral). WoNBias reflects the sociocultural subtleties of bias in both Bengali digital and offline conversations. By bridging online and offline biased contexts, the dataset supports content moderation, policy interventions, and equitable NLP research for Bengali, a low-resource language critically underserved by existing tools. WoNBias aims to combat systemic gender discrimination against women on digital platforms, empowering researchers and practitioners to combat harmful narratives in Bengali-speaking communities.
Anthology ID:
2025.gebnlp-1.10
Volume:
Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP)
Month:
August
Year:
2025
Address:
Vienna, Austria
Editors:
Agnieszka Faleńska, Christine Basta, Marta Costa-jussà, Karolina Stańczak, Debora Nozza
Venues:
GeBNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
105–110
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.gebnlp-1.10/
DOI:
10.18653/v1/2025.gebnlp-1.10
Bibkey:
Cite (ACL):
Md. Raisul Islam Aupi, Nishat Tafannum, Md. Shahidur Rahman, Kh Mahmudul Hassan, and Naimur Rahman. 2025. WoNBias: A Dataset for Classifying Bias & Prejudice Against Women in Bengali Text. In Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP), pages 105–110, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
WoNBias: A Dataset for Classifying Bias & Prejudice Against Women in Bengali Text (Aupi et al., GeBNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.gebnlp-1.10.pdf