Dhanalakshmi V

2026

We investigate the role of large language models (LLMs) in promoting gender-inclusive language by evaluating their ability to rewrite biased text and generate counterfactual narratives across multiple languages. We introduce a shared task with two subtasks: gender-inclusive rewriting and counterfactual generation. The task covers five languages English, German, Spanish, Tamil, and Kannada reflecting diverse grammatical gender systems and sociocultural contexts. We release curated word-level and sentence-level datasets to support controlled inclusive generation. A total of 50 teams registered for the shared task, and around 8 teams submitted results. Submissions are evaluated using a hybrid framework combining rubric-based automatic scoring with expert human judgment. Finally, we provide an overview of participating systems and discuss key findings and challenges observed across languages.

Co-authors

Shunmuga Priya Muthusamy Chinnan 1

Miguel Ángel García-Cumbreras 1

Sylvia Jaki 1

Salud María Jiménez-Zafra 1

Anand Kumar M 1

Thomas Mandl 1

Rahul Ponnusamy 1

Sathiyaraj Thangasamy 1

Venues

LTEDI1
WS1

Fix author