A Three-Level Audit of LLM Alignment for Argument Quality Assessment

Wei-Fan Chen, Jinming Yu, Lucie Flek


Abstract
Large Language Models (LLMs) are increasingly used as automated evaluators of argument quality. However, existing studies typically assess models only through their agreement with human scores, leaving the reasoning process behind these judgments unexplored. In this paper, we propose a three-level audit framework for evaluating the reliability of LLM-based argument quality assessment. The framework distinguishes between (1) surface alignment, measuring agreement between LLM-predicted scores and human annotations; (2) instructional alignment, assessing whether generated rationales adhere to the intended evaluation criteria; and (3) faithfulness alignment, examining whether predicted scores are supported by the generated rationales. To operationalize this audit, we introduce structural rationale prompting, which guides LLMs to generate structured justifications before assigning scores across 11 dimensions of the Dagstuhl-15512 argument quality corpus. We evaluate several LLMs under this framework and find that structural rationale prompting substantially improves agreement with human annotations compared to definition-based prompting. Furthermore, the generated rationales generally follow the evaluation instructions and remain highly consistent with the predicted scores. Overall, our results suggest that auditing LLM evaluators beyond surface score agreement provides deeper insight into the reliability and transparency of LLM-based evaluation.
Anthology ID:
2026.argmining-1.3
Volume:
Proceedings of the 13th Workshop on Argument Mining and Reasoning
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Mohamed Elaraby, Annette Hautli-Janisz, Julia Romberg, Elena Musi, Federico Ruggeri, John Lawrence
Venues:
ArgMining | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
19–31
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.argmining-1.3/
DOI:
Bibkey:
Cite (ACL):
Wei-Fan Chen, Jinming Yu, and Lucie Flek. 2026. A Three-Level Audit of LLM Alignment for Argument Quality Assessment. In Proceedings of the 13th Workshop on Argument Mining and Reasoning, pages 19–31, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
A Three-Level Audit of LLM Alignment for Argument Quality Assessment (Chen et al., ArgMining 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.argmining-1.3.pdf