Shuai Wang
Other people with similar names: Shuai Wang , Shuai Wang , Shuai Wang
2024
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
David Rau
|
Hervé Déjean
|
Nadezhda Chirkova
|
Thibault Formal
|
Shuai Wang
|
Stéphane Clinchant
|
Vassilina Nikoulina
Findings of the Association for Computational Linguistics: EMNLP 2024
Retrieval-Augmented Generation allows to enhance Large Language Models with external knowledge. In response to the recent popularity of generative LLMs, many RAG approaches have been proposed, which involve an intricate number of different configurations such as evaluation datasets, collections, metrics, retrievers, and LLMs. Inconsistent benchmarking poses a major challenge in comparing approaches and understanding the impact of each component in the pipeline. In this work, we study best practices that lay the groundwork for a systematic evaluation of RAG and present BERGEN, an end-to-end library for reproducible research standardizing RAG experiments. In an extensive study focusing on QA, we benchmark different state-of-the-art retrievers, rerankers, and LLMs. Additionally, we analyze existing RAG metrics and datasets.
Search
Fix author
Co-authors
- Nadezhda Chirkova 1
- Stéphane Clinchant 1
- Hervé Déjean 1
- Thibault Formal 1
- Vassilina Nikoulina 1
- show all...