Word2Passage: Word-level Importance Re-weighting for Query Expansion

Jeonghwan Choi, Minjeong Ban, Minseok Kim, Hwanjun Song


Abstract
Retrieval-augmented generation (RAG) enhances the quality of LLM generation by providing relevant chunks, but retrieving accurately from external knowledge remains challenging due to missing contextually important words in query. We present Word2Passage, a novel approach that improves retrieval accuracy by optimizing word importance in query expansion. Our method generates references at word, sentence, and passage levels for query expansion, then determines word importance by considering both their reference level origin and characteristics derived from query types and corpus analysis. Specifically, our method assigns distinct importance scores to words based on whether they originate from word, sentence, or passage-level references. Extensive experiments demonstrate that Word2Passage outperforms existing methods across various datasets and LLM configurations, effectively enhancing both retrieval accuracy and generation quality. The code is publicly available at https://github.com/DISL-Lab/Word2Passage
Anthology ID:
2025.findings-acl.434
Volume:
Findings of the Association for Computational Linguistics: ACL 2025
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8276–8296
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.findings-acl.434/
DOI:
Bibkey:
Cite (ACL):
Jeonghwan Choi, Minjeong Ban, Minseok Kim, and Hwanjun Song. 2025. Word2Passage: Word-level Importance Re-weighting for Query Expansion. In Findings of the Association for Computational Linguistics: ACL 2025, pages 8276–8296, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Word2Passage: Word-level Importance Re-weighting for Query Expansion (Choi et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.findings-acl.434.pdf