Counter-Hypothesis Generation: Towards Evaluating How LLMs Reason about Alternatives
Marzieh Abdolmaleki, Aaron Maladry, Veronique Hoste, Els Lefever
Abstract
Reasoning about alternatives is a fundamental component of human cognition and argumentation, yet it remains unclear whether large language models (LLMs) can coherently generate and assess them. This paper introduces Counter-Hypothesis Generation (CHG), a novel task for evaluating how LLMs construct plausible hypotheses when contextual information changes. Inspired by open-domain commonsense reasoning, where models infer and compare multiple explanations, CHG bridges commonsense and counterfactual reasoning by requiring models to generate hypotheses that remain logically consistent with modified premises. We present a test set annotated by a human expert and complemented with counter-hypotheses generated by OpenAI-o3 and DeepSeek-r1. Experimental results reveal that even advanced reasoning models exhibit notable limitations in counter-hypothesis generation.- Anthology ID:
- 2026.lrec-main.424
- Volume:
- Proceedings of the Fifteenth Language Resources and Evaluation Conference
- Month:
- May
- Year:
- 2026
- Address:
- Palma de Mallorca, Spain
- Editors:
- Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
- Venue:
- LREC
- SIG:
- Publisher:
- ELRA Language Resource Association
- Note:
- Pages:
- 5445–5449
- Language:
- URL:
- https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.424/
- DOI:
- Cite (ACL):
- Marzieh Abdolmaleki, Aaron Maladry, Veronique Hoste, and Els Lefever. 2026. Counter-Hypothesis Generation: Towards Evaluating How LLMs Reason about Alternatives. International Conference on Language Resources and Evaluation, main:5445–5449.
- Cite (Informal):
- Counter-Hypothesis Generation: Towards Evaluating How LLMs Reason about Alternatives (Abdolmaleki et al., LREC 2026)
- PDF:
- https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.424.pdf