Characterizing non-binary French: A first step towards debiasing gender inference

Marie Flesch, Heather Burnett


Abstract
This paper addresses a bias of gender inference systems: their binary nature. Based on the observation that, for French, systems based on pattern-matching of grammatical gender markers in “I am” expressions perform better than machine-learning approaches (Ciot et al. 2013), we examine the use of grammatical gender by non-binary individuals. We describe the construction of a corpus of texts produced by non-binary authors on Reddit, (formely) Twitter and three forums. Our linguistic analysis shows three main patterns of use: authors who use non-binary markers, authors who consistently use one grammatical gender, and authors who use both feminine and masculine markers. Using this knowledge, we make proposals for the improvements of existing gender inference systems based on grammatical gender.
Anthology ID:
2025.gebnlp-1.16
Volume:
Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP)
Month:
August
Year:
2025
Address:
Vienna, Austria
Editors:
Agnieszka Faleńska, Christine Basta, Marta Costa-jussà, Karolina Stańczak, Debora Nozza
Venues:
GeBNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
160–170
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.gebnlp-1.16/
DOI:
Bibkey:
Cite (ACL):
Marie Flesch and Heather Burnett. 2025. Characterizing non-binary French: A first step towards debiasing gender inference. In Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP), pages 160–170, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Characterizing non-binary French: A first step towards debiasing gender inference (Flesch & Burnett, GeBNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.gebnlp-1.16.pdf