Re2-DocRED: Revisiting Revisited-DocRED for Joint Entity and Relation Extraction

Chen Kim Heng; Shao Wen Tong; Julian Wong Wei Sheng

Re2-DocRED: Revisiting Revisited-DocRED for Joint Entity and Relation Extraction

Chen Kim Heng, Shao Wen Tong, Julian Wong Wei Sheng

Abstract

Document-level Joint Entity and Relation Extraction (JERE) benchmarks such as DocRED, Re-DocRED, and DocGNRE suffer from pervasive False Negatives (FN), undermining training and evaluation. In this paper, we introduce SiftingLogic – a training-free annotation pipeline that leverages LLMs with user-specifiable reasoning, enriched inverse/co-occurring relation schemas, and novel entity-level constraints to systematically address FN gaps. Applying SiftingLogic and our enriched schema of inverse and co-occurring relations, we add 29,580 verified triplets to Re-DocRED (train/dev, +27%) and over 9,700 verified triplets to DocGNRE test (+49.89%), yielding the enhanced Re²-DocRED dataset. Beyond English datasets, we also apply our SiftingLogic to REDFM Mandarin test set, resulting in a significant increase in triplets from 663 to 1,391 (+109.8%) demonstrating our pipeline’s generalisability across languages and datasets. Our experiments show that recall scores of models trained on existing public datasets drop notably on our revised splits, whereas our enriched training set mitigates this, underscoring persistent FN gaps and motivating our proposed SiftingLogic and Re²-DocRED. To facilitate further research and reproducibility of our work, the Re²-DocRED dataset is released at https://github.com/klassessg/re2-docred.

Anthology ID:: 2026.eacl-long.213
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4585–4621
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.213/
DOI:
Bibkey:
Cite (ACL):: Chen Kim Heng, Shao Wen Tong, and Julian Wong Wei Sheng. 2026. Re2-DocRED: Revisiting Revisited-DocRED for Joint Entity and Relation Extraction. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4585–4621, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Re2-DocRED: Revisiting Revisited-DocRED for Joint Entity and Relation Extraction (Heng et al., EACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.213.pdf

PDF Cite Search Fix data