Transitioning from benchmarks to a real-world case of information-seeking in Scientific Publications
Chyrine Tahri, Aurore Bochnakian, Patrick Haouat, Xavier Tannier
Abstract
Although recent years have been marked by incredible advances in the whole development process of NLP systems, there are still blind spots in characterizing what is still hampering real-world adoption of models in knowledge-intensive settings. In this paper, we illustrate through a real-world zero-shot text search case for information seeking in scientific papers, the masked phenomena that the current process of measuring performance might not reflect, even when benchmarks are, in appearance, faithfully representative of the task at hand. In addition to experimenting with TREC-COVID and NFCorpus, we provide an industrial, expert-carried/annotated, case of studying vitamin B’s impact on health. We thus discuss the misalignment between solely focusing on single-metric performance as a criterion for model choice and relevancy as a subjective measure for meeting a user’s need.- Anthology ID:
- 2023.findings-acl.68
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2023
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Editors:
- Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1066–1076
- Language:
- URL:
- https://aclanthology.org/2023.findings-acl.68
- DOI:
- 10.18653/v1/2023.findings-acl.68
- Cite (ACL):
- Chyrine Tahri, Aurore Bochnakian, Patrick Haouat, and Xavier Tannier. 2023. Transitioning from benchmarks to a real-world case of information-seeking in Scientific Publications. In Findings of the Association for Computational Linguistics: ACL 2023, pages 1066–1076, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Transitioning from benchmarks to a real-world case of information-seeking in Scientific Publications (Tahri et al., Findings 2023)
- PDF:
- https://preview.aclanthology.org/dois-2013-emnlp/2023.findings-acl.68.pdf