Agentic Oversight via Dialectic Reasoning

Leonardo Ranaldi; Federico Ranaldi

Agentic Oversight via Dialectic Reasoning

Abstract

Debate has emerged as a promising oversight mechanism for Large Language Models (LLMs) amid rising systemic complexity, particularly where models outperform human evaluators. Yet, Debate provides little verifiable evidence for its final judgments, and its scalability remains largely unexplored. To make oversight grounded and scale as capabilities extend, we introduce an Agentic Oversight framework. By using Dialectic Argumentation as a reasoning function, we extend this paradigm to multilingual and multimodal spaces. We employ a weak-to-strong oversight approach based on two expert models that evaluate and defend contesting answers, while a third blind judge determines the winner using Dialectic Argumentation. Experts argue only for belief-consistent answers, founding the Debate on disagreements. We experimented with six tasks on our framework in both multilingual and multimodal scenarios, and dialectic argumentation consistently outperforms single-expert baselines. Moreover, we show that dialectic judgements from a weaker model deliver argument-mediated supervision that, via fine-tuning, instils unsupervised reasoning signals in expert models.

Anthology ID:: 2026.acl-long.1143
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 24924–24944
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1143/
DOI:
Bibkey:
Cite (ACL):: Leonardo Ranaldi and Federico Ranaldi. 2026. Agentic Oversight via Dialectic Reasoning. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 24924–24944, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Agentic Oversight via Dialectic Reasoning (Ranaldi & Ranaldi, ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1143.pdf
Checklist:: 2026.acl-long.1143.checklist.pdf

PDF Cite Search Checklist Fix data