Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language Models

Bumjin Park; Leejinsil Leejinsil; Jaesik Choi

Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language Models

Bumjin Park, Leejinsil Leejinsil, Jaesik Choi

Abstract

Large language models (LLMs) are increasingly engaging in moral and ethical reasoning, where criteria for judgment are often unclear, even for humans. While LLM alignment studies cover many areas, one important yet underexplored area is how LLMs make judgments about obligations. This work reveals a strong tendency in LLMs to judge non-obligatory contexts as obligations when prompts are augmented with modal expressions such as must or ought to. We introduce this phenomenon as Deontological Keyword Bias (DKB). We find that LLMs judge over 90% of commonsense scenarios as obligations when modal expressions are present. This tendency is consist across various LLM families, question types, and answer formats. To mitigate DKB, we propose a judgment strategy that integrates few-shot examples with reasoning prompts. This study sheds light on how modal expressions, as a form of linguistic framing, influence the normative decisions of LLMs and underscores the importance of addressing such biases to ensure judgment alignment.

Anthology ID:: 2025.acl-long.360
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7277–7296
Language:
URL:: https://preview.aclanthology.org/landing_page/2025.acl-long.360/
DOI:
Bibkey:
Cite (ACL):: Bumjin Park, Leejinsil Leejinsil, and Jaesik Choi. 2025. Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language Models. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7277–7296, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language Models (Park et al., ACL 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/landing_page/2025.acl-long.360.pdf

PDF Cite Search Fix data