Position: Evaluations of AI Moral Reasoning Still Miss Half of the Picture

Aidan Kierans, Ritam Dutt, Kaley Rittichier, Shiri Dori-Hacohen, Avijit Ghosh


Abstract
Recent work on evaluating the moral competence of large language models (LLMs) has focused primarily on what we call the moral value problem, i.e., whether model outputs align with human moral values. In contrast, the moral norm problem, i.e., whether models can identify and correctly apply context-sensitive moral norms, remains underexplored. We posit that this imbalance stems from the field’s reliance on descriptive ethics frameworks, such as Moral Foundations Theory and Kohlberg’s stages of moral development, which emphasize value representation over normative application. We review existing benchmarks and evaluation methods, and show that they cluster heavily around the value problem, while discussion regarding normative ethics remains underrepresented. We identify three crucial gaps: (i) the absence of high-quality groundtruth data for moral norms and their applications, (ii) insufficient evaluation of intermediate reasoning processes, and (iii) limited attention to the identification of morally relevant features in context. Subsequently, we propose a research agenda that includes the development of standardized formal representations for normative theories, the construction of expert-annotated datasets capturing norm application, and evaluation protocols that explicitly distinguish between values-level and normslevel competence. Our goal is to encourage a more systematic study of normative reasoning in LLMs.
Anthology ID:
2026.evaleval-1.38
Volume:
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
Month:
July
Year:
2026
Address:
San Diego, CA
Editors:
Mubashara Akhtar, Jan Batzner, Leshem Choshen, Avijit Ghosh, Usman Gohar, Jennifer Mickel, Ichhya Pant, Zeerak Talat, Michelle Lin
Venues:
EvalEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
237–244
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.evaleval-1.38/
DOI:
Bibkey:
Cite (ACL):
Aidan Kierans, Ritam Dutt, Kaley Rittichier, Shiri Dori-Hacohen, and Avijit Ghosh. 2026. Position: Evaluations of AI Moral Reasoning Still Miss Half of the Picture. In Proceedings of the Workshop on Evaluating Evaluations (EvalEval), pages 237–244, San Diego, CA. Association for Computational Linguistics.
Cite (Informal):
Position: Evaluations of AI Moral Reasoning Still Miss Half of the Picture (Kierans et al., EvalEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.evaleval-1.38.pdf