@inproceedings{wu-deng-2025-counts,
    title = "What Counts Underlying {LLM}s' Moral Dilemma Judgments?",
    author = "Wu, Wenya  and
      Deng, Weihong",
    editor = "Atwell, Katherine  and
      Biester, Laura  and
      Borah, Angana  and
      Dementieva, Daryna  and
      Ignat, Oana  and
      Kotonya, Neema  and
      Liu, Ziyi  and
      Wan, Ruyuan  and
      Wilson, Steven  and
      Zhao, Jieyu",
    booktitle = "Proceedings of the Fourth Workshop on NLP for Positive Impact (NLP4PI)",
    month = jul,
    year = "2025",
    address = "Vienna, Austria",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2025.nlp4pi-1.12/",
    doi = "10.18653/v1/2025.nlp4pi-1.12",
    pages = "144--150",
    ISBN = "978-1-959429-19-7",
    abstract = "Moral judgments in LLMs increasingly capture the attention of researchers in AI ethics domain. This study explores moral judgments of three open-source large language models (LLMs){---}Qwen-1.5-14B, Llama3-8B, and DeepSeek-R1 in plausible moral dilemmas, examining their sensitivity to social exposure and collaborative decision-making. Using a dual-process framework grounded in deontology and utilitarianism, we evaluate LLMs' responses to moral dilemmas under varying social contexts. Results reveal that all models are significantly influenced by moral norms rather than consequences, with DeepSeek-R1 exhibiting a stronger action tendency compared to Qwen-1.5-14B and Llama3-8B, which show higher inaction preferences. Social exposure and collaboration impact LLMs differently: Qwen-1.5-14B becomes less aligned with moral norms under observation, while DeepSeek-R1{'}s action tendency is moderated by social collaboration. These findings highlight the nuanced moral reasoning capabilities of LLMs and their varying sensitivity to social cues, providing insights into the ethical alignment of AI systems in socially embedded contexts."
}Markdown (Informal)
[What Counts Underlying LLMs’ Moral Dilemma Judgments?](https://preview.aclanthology.org/ingest-emnlp/2025.nlp4pi-1.12/) (Wu & Deng, NLP4PI 2025)
ACL