Optimizing RAG: Classifying Queries for Dynamic Processing

Kabir Olawore, Michael McTear, Yaxin Bi, David Griol


Abstract
In Retrieval-Augmented Generation (RAG) systems efficient information retrieval is crucial for enhancing user experience and satisfaction, as response times and computational demands significantly impact performance. RAG can be unnecessarily resource-intensive for frequently asked questions (FAQs) and simple questions. In this paper we introduce an approach in which we categorize user questions into simple queries that do not require RAG processing. Evaluation results show that our proposal reduces latency and improves response efficiency compared to systems relying solely on RAG.
Anthology ID:
2025.iwsds-1.14
Volume:
Proceedings of the 15th International Workshop on Spoken Dialogue Systems Technology
Month:
May
Year:
2025
Address:
Bilbao, Spain
Editors:
Maria Ines Torres, Yuki Matsuda, Zoraida Callejas, Arantza del Pozo, Luis Fernando D'Haro
Venues:
IWSDS | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
160–164
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.iwsds-1.14/
DOI:
Bibkey:
Cite (ACL):
Kabir Olawore, Michael McTear, Yaxin Bi, and David Griol. 2025. Optimizing RAG: Classifying Queries for Dynamic Processing. In Proceedings of the 15th International Workshop on Spoken Dialogue Systems Technology, pages 160–164, Bilbao, Spain. Association for Computational Linguistics.
Cite (Informal):
Optimizing RAG: Classifying Queries for Dynamic Processing (Olawore et al., IWSDS 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.iwsds-1.14.pdf