Counter-Hypothesis Generation: Towards Evaluating How LLMs Reason about Alternatives

Marzieh Abdolmaleki; Aaron Maladry; Veronique Hoste; Els Lefever

Counter-Hypothesis Generation: Towards Evaluating How LLMs Reason about Alternatives

Marzieh Abdolmaleki, Aaron Maladry, Veronique Hoste, Els Lefever

Abstract

Reasoning about alternatives is a fundamental component of human cognition and argumentation, yet it remains unclear whether large language models (LLMs) can coherently generate and assess them. This paper introduces Counter-Hypothesis Generation (CHG), a novel task for evaluating how LLMs construct plausible hypotheses when contextual information changes. Inspired by open-domain commonsense reasoning, where models infer and compare multiple explanations, CHG bridges commonsense and counterfactual reasoning by requiring models to generate hypotheses that remain logically consistent with modified premises. We present a test set annotated by a human expert and complemented with counter-hypotheses generated by OpenAI-o3 and DeepSeek-r1. Experimental results reveal that even advanced reasoning models exhibit notable limitations in counter-hypothesis generation.

Anthology ID:: 2026.lrec-main.424
Volume:: Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:: May
Year:: 2026
Address:: Palma de Mallorca, Spain
Editors:: Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:: LREC
SIG:
Publisher:: ELRA Language Resource Association
Note:
Pages:: 5445–5449
Language:
URL:: https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.424/
DOI:
Bibkey:
Cite (ACL):: Marzieh Abdolmaleki, Aaron Maladry, Veronique Hoste, and Els Lefever. 2026. Counter-Hypothesis Generation: Towards Evaluating How LLMs Reason about Alternatives. International Conference on Language Resources and Evaluation, main:5445–5449.
Cite (Informal):: Counter-Hypothesis Generation: Towards Evaluating How LLMs Reason about Alternatives (Abdolmaleki et al., LREC 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.424.pdf

PDF Cite Search Fix data