Candidate-Aware Retrieval and Reranking for Multiple-Choice Question Answering: Arabic as a Case Study

Yassine Bouziane, Youness Moukafih, Mounir Ghogho


Abstract
Large language models (LLMs) have recently achieved impressive results on multiple-choice question answering (MCQA), with retrieval-augmented generation (RAG) emerging as an effective strategy for improving the performance of smaller models. However, existing RAG formulations face persistent challenges: retrieving too many passages often introduces noise, and even when relevant content is retrieved, models may still struggle with partially relevant or conflicting information. Moreover, while LLMs perform strongly on English benchmarks, their accuracy declines substantially on Arabic multi-task evaluations, revealing ongoing issues in cross-lingual transfer and domain adaptation. In this paper, we propose a novel approach, using Arabic as a representative case study, that jointly models the relevance of both the question and its candidate answers when selecting contextual passages. The method employs a lightweight reranker trained with a hybrid regression–triplet loss objective to identify passages that provide discriminative and reliable evidence. Extensive experiments across multiple model sizes and humanities domains show that our approach consistently outperforms both standard RAG baselines and reranker baselines, delivering two- to threefold improvements while remaining competitive with considerably larger models.
Anthology ID:
2026.findings-acl.435
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8967–8977
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.435/
DOI:
Bibkey:
Cite (ACL):
Yassine Bouziane, Youness Moukafih, and Mounir Ghogho. 2026. Candidate-Aware Retrieval and Reranking for Multiple-Choice Question Answering: Arabic as a Case Study. In Findings of the Association for Computational Linguistics: ACL 2026, pages 8967–8977, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Candidate-Aware Retrieval and Reranking for Multiple-Choice Question Answering: Arabic as a Case Study (Bouziane et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.435.pdf
Checklist:
 2026.findings-acl.435.checklist.pdf