Can Explicit Gender Information Improve Zero-Shot Machine Translation?

Van-Hien Tran, Huy Hien Vu, Hideki Tanaka, Masao Utiyama


Abstract
Large language models (LLMs) have demonstrated strong zero-shot machine translation (MT) performance but often exhibit gender bias that is present in their training data, especially when translating into grammatically gendered languages. In this paper, we investigate whether explicitly providing gender information can mitigate this issue and improve translation quality. We propose a two-step approach: (1) inferring entity gender from context, and (2) incorporating this information into prompts using either Structured Tagging or Natural Language. Experiments with five LLMs across four language pairs show that explicit gender cues consistently reduce gender errors, with structured tagging yielding the largest gains. Our results highlight prompt-level gender disambiguation as a simple yet effective strategy for more accurate and fair zero-shot MT.
Anthology ID:
2025.gebnlp-1.17
Volume:
Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP)
Month:
August
Year:
2025
Address:
Vienna, Austria
Editors:
Agnieszka Faleńska, Christine Basta, Marta Costa-jussà, Karolina Stańczak, Debora Nozza
Venues:
GeBNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
171–181
Language:
URL:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.gebnlp-1.17/
DOI:
Bibkey:
Cite (ACL):
Van-Hien Tran, Huy Hien Vu, Hideki Tanaka, and Masao Utiyama. 2025. Can Explicit Gender Information Improve Zero-Shot Machine Translation?. In Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP), pages 171–181, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Can Explicit Gender Information Improve Zero-Shot Machine Translation? (Tran et al., GeBNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.gebnlp-1.17.pdf