Negative Sampling Techniques in Dense Retrieval: A Survey

Laurin Wischounig, Abdelrahman Abdallah, Adam Jatowt


Abstract
Information Retrieval (IR) is fundamental to many modern NLP applications. The rise of dense retrieval (DR), using neural networks to learn semantic vector representations, has significantly advanced IR performance. Central to training effective dense retrievers through contrastive learning is the selection of informative negative samples. Synthesizing 35 seminal papers, this survey provides a comprehensive and up-to-date overview of negative sampling techniques in dense IR. Our unique contribution is the focus on modern NLP applications and the inclusion of recent Large Language Model (LLM)-driven methods, an area absent in prior reviews. We propose a taxonomy that categorizes techniques, including random, static/dynamically mined, and synthetic datasets. We then analyze these approaches with respect to trade-offs between effectiveness, computational cost, and implementation difficulty. The survey concludes by outlining current challenges and promising future directions for the use of LLM-generated synthetic data.
Anthology ID:
2026.findings-eacl.157
Volume:
Findings of the Association for Computational Linguistics: EACL 2026
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3003–3020
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.157/
DOI:
Bibkey:
Cite (ACL):
Laurin Wischounig, Abdelrahman Abdallah, and Adam Jatowt. 2026. Negative Sampling Techniques in Dense Retrieval: A Survey. In Findings of the Association for Computational Linguistics: EACL 2026, pages 3003–3020, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Negative Sampling Techniques in Dense Retrieval: A Survey (Wischounig et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.157.pdf
Checklist:
 2026.findings-eacl.157.checklist.pdf