Avijit Ghosh
2026
Position: Evaluations of AI Moral Reasoning Still Miss Half of the Picture
Aidan Kierans | Ritam Dutt | Kaley Rittichier | Shiri Dori-Hacohen | Avijit Ghosh
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
Aidan Kierans | Ritam Dutt | Kaley Rittichier | Shiri Dori-Hacohen | Avijit Ghosh
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
Recent work on evaluating the moral competence of large language models (LLMs) has focused primarily on what we call the moral value problem, i.e., whether model outputs align with human moral values. In contrast, the moral norm problem, i.e., whether models can identify and correctly apply context-sensitive moral norms, remains underexplored. We posit that this imbalance stems from the field’s reliance on descriptive ethics frameworks, such as Moral Foundations Theory and Kohlberg’s stages of moral development, which emphasize value representation over normative application. We review existing benchmarks and evaluation methods, and show that they cluster heavily around the value problem, while discussion regarding normative ethics remains underrepresented. We identify three crucial gaps: (i) the absence of high-quality groundtruth data for moral norms and their applications, (ii) insufficient evaluation of intermediate reasoning processes, and (iii) limited attention to the identification of morally relevant features in context. Subsequently, we propose a research agenda that includes the development of standardized formal representations for normative theories, the construction of expert-annotated datasets capturing norm application, and evaluation protocols that explicitly distinguish between values-level and normslevel competence. Our goal is to encourage a more systematic study of normative reasoning in LLMs.
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
Mubashara Akhtar | Jan Batzner | Leshem Choshen | Avijit Ghosh | Usman Gohar | Jennifer Mickel | Ichhya Pant | Zeerak Talat | Michelle Lin
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
Mubashara Akhtar | Jan Batzner | Leshem Choshen | Avijit Ghosh | Usman Gohar | Jennifer Mickel | Ichhya Pant | Zeerak Talat | Michelle Lin
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
2025
SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models
Margaret Mitchell | Giuseppe Attanasio | Ioana Baldini | Miruna Clinciu | Jordan Clive | Pieter Delobelle | Manan Dey | Sil Hamilton | Timm Dill | Jad Doughman | Ritam Dutt | Avijit Ghosh | Jessica Zosa Forde | Carolin Holtermann | Lucie-Aimée Kaffee | Tanmay Laud | Anne Lauscher | Roberto L Lopez-Davila | Maraim Masoud | Nikita Nangia | Anaelia Ovalle | Giada Pistilli | Dragomir Radev | Beatrice Savoldi | Vipul Raheja | Jeremy Qin | Esther Ploeger | Arjun Subramonian | Kaustubh Dhole | Kaiser Sun | Amirbek Djanibekov | Jonibek Mansurov | Kayo Yin | Emilio Villa Cueva | Sagnik Mukherjee | Jerry Huang | Xudong Shen | Jay Gala | Hamdan Al-Ali | Tair Djanibekov | Nurdaulet Mukhituly | Shangrui Nie | Shanya Sharma | Karolina Stanczak | Eliza Szczechla | Tiago Timponi Torrent | Deepak Tunuguntla | Marcelo Viridiano | Oskar Van Der Wal | Adina Yakefu | Aurélie Névéol | Mike Zhang | Sydney Zink | Zeerak Talat
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Margaret Mitchell | Giuseppe Attanasio | Ioana Baldini | Miruna Clinciu | Jordan Clive | Pieter Delobelle | Manan Dey | Sil Hamilton | Timm Dill | Jad Doughman | Ritam Dutt | Avijit Ghosh | Jessica Zosa Forde | Carolin Holtermann | Lucie-Aimée Kaffee | Tanmay Laud | Anne Lauscher | Roberto L Lopez-Davila | Maraim Masoud | Nikita Nangia | Anaelia Ovalle | Giada Pistilli | Dragomir Radev | Beatrice Savoldi | Vipul Raheja | Jeremy Qin | Esther Ploeger | Arjun Subramonian | Kaustubh Dhole | Kaiser Sun | Amirbek Djanibekov | Jonibek Mansurov | Kayo Yin | Emilio Villa Cueva | Sagnik Mukherjee | Jerry Huang | Xudong Shen | Jay Gala | Hamdan Al-Ali | Tair Djanibekov | Nurdaulet Mukhituly | Shangrui Nie | Shanya Sharma | Karolina Stanczak | Eliza Szczechla | Tiago Timponi Torrent | Deepak Tunuguntla | Marcelo Viridiano | Oskar Van Der Wal | Adina Yakefu | Aurélie Névéol | Mike Zhang | Sydney Zink | Zeerak Talat
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Large Language Models (LLMs) reproduce and exacerbate the social biases present in their training data, and resources to quantify this issue are limited. While research has attempted to identify and mitigate such biases, most efforts have been concentrated around English, lagging the rapid advancement of LLMs in multilingual settings. In this paper, we introduce a new multilingual parallel dataset SHADES to help address this issue, designed for examining culturally-specific stereotypes that may be learned by LLMs. The dataset includes stereotypes from 20 regions around the world and 16 languages, spanning multiple identity categories subject to discrimination worldwide. We demonstrate its utility in a series of exploratory evaluations for both “base” and “instruction-tuned” language models. Our results suggest that stereotypes are consistently reflected across models and languages, with some languages and models indicating much stronger stereotype biases than others.
Search
Fix author
Co-authors
- Ritam Dutt 2
- Mubashara Akhtar 1
- Hamdan Al-Ali 1
- Giuseppe Attanasio 1
- Ioana Baldini 1
- Jan Batzner 1
- Leshem Choshen 1
- Miruna Clinciu 1
- Jordan Clive 1
- Pieter Delobelle 1
- Manan Dey 1
- Kaustubh Dhole 1
- Timm Dill 1
- Amirbek Djanibekov 1
- Shiri Dori-Hacohen 1
- Jad Doughman 1
- Jessica Zosa Forde 1
- Jay Gala 1
- Usman Gohar 1
- Sil Hamilton 1
- Carolin Holtermann 1
- Jerry Huang 1
- Lucie-Aimée Kaffee 1
- Aidan Kierans 1
- Tanmay Laud 1
- Anne Lauscher 1
- Michelle Lin 1
- Roberto L Lopez-Davila 1
- Jonibek Mansurov 1
- Maraim Masoud 1
- Jennifer Mickel 1
- Margaret Mitchell 1
- Sagnik Mukherjee 1
- Nurdaulet Mukhituly 1
- Nikita Nangia 1
- Aurelie Neveol 1
- Shangrui Nie 1
- Anaelia Ovalle 1
- Ichhya Pant 1
- Giada Pistilli 1
- Esther Ploeger 1
- Jeremy Qin 1
- Dragomir Radev 1
- Vipul Raheja 1
- Kaley Rittichier 1
- Beatrice Savoldi 1
- Shanya Sharma 1
- Xudong Shen 1
- Karolina Stanczak 1
- Arjun Subramonian 1
- Kaiser Sun 1
- Eliza Szczechla 1
- Tair Djanibekov 1
- Zeerak Talat 1
- Zeerak Talat 1
- Tiago Timponi Torrent 1
- Deepak Tunuguntla 1
- Oskar Van Der Wal 1
- Emilio Villa-Cueva 1
- Marcelo Viridiano 1
- Adina Yakefu 1
- Kayo Yin 1
- Mike Zhang 1
- Sydney Zink 1