GUMBridge: A Corpus for Varieties of Bridging Anaphora

Lauren Levine, Amir Zeldes


Abstract
Bridging is an anaphoric phenomenon where the referent of an entity in a discourse is dependent on a previous, non-identical entity for interpretation, such as in "There is a house. The door is red," where the door is specifically understood to be the door of the aforementioned house. While there are several existing resources in English for bridging anaphora, most are small, provide limited coverage of the phenomenon, and/or provide limited genre coverage. In this paper, we introduce GUMBridge, a new resource for bridging, which includes 24 diverse genres of English, providing both broad coverage for the phenomenon, and granular annotations for the multi-subtype categorization of bridging varieties. We also present an evaluation of annotation quality and report on baseline performance using open and closed source contemporary LLMs on three tasks underlying our data, showing that bridging resolution and subtype classification remain difficult NLP tasks in the age of LLMs.
Anthology ID:
2026.lrec-main.543
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
6823–6837
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.543/
DOI:
Bibkey:
Cite (ACL):
Lauren Levine and Amir Zeldes. 2026. GUMBridge: A Corpus for Varieties of Bridging Anaphora. International Conference on Language Resources and Evaluation, main:6823–6837.
Cite (Informal):
GUMBridge: A Corpus for Varieties of Bridging Anaphora (Levine & Zeldes, LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.543.pdf
Optionalsupplementarymaterial:
 2026.lrec-main.543.OptionalSupplementaryMaterial.zip