Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers

Nishant Balepur; Atrey Desai; Rachel Rudinger

Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers

Nishant Balepur, Atrey Desai, Rachel Rudinger

Abstract

Large language models (LLMs) now give reasoning before answering, excelling in tasks like multiple-choice question answering (MCQA). Yet, a concern is that LLMs do not solve MCQs as intended, as work finds LLMs sans reasoning succeed in MCQA without using the question, i.e., choices-only. Such partial-input success is often deemed problematic, but reasoning traces could reveal if these strategies are truly shallow in choices-only settings. To study these strategies, reasoning LLMs solve MCQs in full and choices-only inputs; test-time reasoning often boosts accuracy on full and in choices-only half the time. While possibly due to shallow shortcuts, choices-only success is barely affected by the length of reasoning traces, and after finding traces pass faithfulness tests, we show they use less problematic strategies like inferring missing questions. In all, we challenge claims that partial-input success is always a flaw, so we discuss how reasoning traces could separate problematic data from less problematic reasoning.

Anthology ID:: 2026.acl-short.23
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 250–272
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-short.23/
DOI:
Bibkey:
Cite (ACL):: Nishant Balepur, Atrey Desai, and Rachel Rudinger. 2026. Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 250–272, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers (Balepur et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-short.23.pdf
Checklist:: 2026.acl-short.23.checklist.pdf

PDF Cite Search Checklist Fix data