Hype or not? Formalizing Automatic Promotional Language Detection in Biomedical Research

Bojan Batalo, Erica K. Shimomoto, Dipesh Satav, Neil Millar


Abstract
In science, promotional language (’hype’) is increasing and can undermine objective evaluation of evidence, impede research development, and erode trust in science. In this paper, we introduce the task of automatic detection of hype, which we define as hyperbolic or subjective language that authors use to glamorize, promote, embellish, or exaggerate aspects of their research. We propose formalized guidelines for identifying hype language and apply them to annotate a portion of the National Institutes of Health (NIH) grant application corpus. We then evaluate traditional text classifiers and language models on this task, comparing their performance with a human baseline. Our experiments show that formalizing annotation guidelines can help humans reliably annotate candidate hype adjectives and that using our annotated dataset to train machine learning models yields promising results. Our findings highlight the linguistic complexity of the task and the potential need for domain knowledge. While some linguistic works address hype detection, to the best of our knowledge, we are the first to approach it as a natural language processing task. Our annotation guidelines and dataset are available at https://github.com/hype-busters/eacl2026-hype-dataset.
Anthology ID:
2026.eacl-long.328
Volume:
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
6979–6992
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.328/
DOI:
Bibkey:
Cite (ACL):
Bojan Batalo, Erica K. Shimomoto, Dipesh Satav, and Neil Millar. 2026. Hype or not? Formalizing Automatic Promotional Language Detection in Biomedical Research. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6979–6992, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Hype or not? Formalizing Automatic Promotional Language Detection in Biomedical Research (Batalo et al., EACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.328.pdf