Masking or Mitigating? Deconstructing the Impact of Query Rewriting on Retriever Biases in RAG

Agam Goyal, Koyel Mukherjee, Apoorv Saxena, Anirudh Phukan, Eshwar Chandrasekharan, Hari Sundaram


Abstract
Dense retrievers in retrieval-augmented generation (RAG) systems exhibit systematic biases—including brevity, position, literal matching, and repetition biases—that can compromise retrieval quality. Query rewriting techniques are now standard in RAG pipelines, yet their impact on these biases remains unexplored. We present the first systematic study of how query enhancement techniques affect dense retrieval biases, evaluating five methods across six retrievers. Our findings reveal that simple LLM-based rewriting achieves the strongest aggregate bias reduction (54%), yet fails under adversarial conditions where multiple biases combine. Mechanistic analysis uncovers two distinct mechanisms: simple rewriting reduces bias through increased score variance, while pseudo-document methods achieve reduction through genuine decorrelation from bias-inducing features. However, no technique uniformly addresses all biases, and effects vary substantially across retrievers. Our results provide practical guidance for selecting query enhancement strategies based on specific bias vulnerabilities. More broadly, we establish a taxonomy distinguishing query-document interaction biases from document encoding biases, clarifying the limits of query-side interventions for debiasing RAG systems.
Anthology ID:
2026.findings-acl.414
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8517–8530
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.414/
DOI:
Bibkey:
Cite (ACL):
Agam Goyal, Koyel Mukherjee, Apoorv Saxena, Anirudh Phukan, Eshwar Chandrasekharan, and Hari Sundaram. 2026. Masking or Mitigating? Deconstructing the Impact of Query Rewriting on Retriever Biases in RAG. In Findings of the Association for Computational Linguistics: ACL 2026, pages 8517–8530, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Masking or Mitigating? Deconstructing the Impact of Query Rewriting on Retriever Biases in RAG (Goyal et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.414.pdf
Checklist:
 2026.findings-acl.414.checklist.pdf