CuriosAI at SemEval-2026 Task 8: Hybrid retrieval system with repeated sampling for generation

Aiswariya Manoj Kumar, Hiroki Takushima, Fumika Beppu, Yuki Shibata, Daichi Yamaga, Takayuki Hori


Abstract
SemEval-2026 Task 8 (MTRAGEval) evaluates multi-turn Retrieval-Augmented Generation (RAG) under conversational challenges such as non-standalone turns, underspecification, and answerability detection. These conditions amplify retrieval and generation errors that standard single-turn RAG pipelines fail to address effectively. We present a robustness-oriented multi-turn RAG system combining contextual query rewriting, heterogeneous hybrid retrieval fused with Reciprocal Rank Fusion (RRF), domain-adaptive Low-Rank Adaptation (LoRA) reranking, and repeated sampling with metric-guided selection. On the official test set, our approach outperforms the organizers’ baselines across all subtasks: Retrieval (nDCG@5: 0.5396 vs. 0.4795), Generation (0.7571 vs. 0.6390), and RAG (0.5486 vs. 0.5366). Our system ranks 5th in Subtask A, 5th in Subtask B, and 7th in Subtask C on the official leaderboard. These results demonstrate that calibrated hybrid retrieval combined with robust generation selection is effective for multi-turn RAG.
Anthology ID:
2026.semeval-1.151
Volume:
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1106–1115
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.151/
DOI:
Bibkey:
Cite (ACL):
Aiswariya Manoj Kumar, Hiroki Takushima, Fumika Beppu, Yuki Shibata, Daichi Yamaga, and Takayuki Hori. 2026. CuriosAI at SemEval-2026 Task 8: Hybrid retrieval system with repeated sampling for generation. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 1106–1115, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
CuriosAI at SemEval-2026 Task 8: Hybrid retrieval system with repeated sampling for generation (Manoj Kumar et al., SemEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.151.pdf
Supplementarymaterial:
 2026.semeval-1.151.SupplementaryMaterial.tex