Dynamic Reference Extraction and Linking across Multiple Scholarly Knowledge Graphs

Nicolau Duran-Silva, Pablo Accuosto


Abstract
References are an important feature of scientific literature; however, they are unstructured, heterogeneous, noisy, and often multilingual. We present a modular pipeline that leverages fine-tuned transformer models for reference location, classification, parsing, retrieval, and re-ranking across multiple scholarly knowledge graphs, with a focus on multilingual and non-traditional sources such as patents and policy documents. Our main contributions are: a unified pipeline for reference extraction and linking across diverse document types, openly released annotated datasets, fine-tuned models for each subtask, and evaluations across multiple scholarly knowledge graphs, enabling richer, more inclusive infrastructures for open research information.
Anthology ID:
2025.wasp-main.9
Volume:
Proceedings of the Third Workshop for Artificial Intelligence for Scientific Publications
Month:
December
Year:
2025
Address:
Mumbai, India and virtual
Editors:
Alberto Accomazzi, Tirthankar Ghosal, Felix Grezes, Kelly Lockhart
Venues:
WASP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
80–86
Language:
URL:
https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.wasp-main.9/
DOI:
Bibkey:
Cite (ACL):
Nicolau Duran-Silva and Pablo Accuosto. 2025. Dynamic Reference Extraction and Linking across Multiple Scholarly Knowledge Graphs. In Proceedings of the Third Workshop for Artificial Intelligence for Scientific Publications, pages 80–86, Mumbai, India and virtual. Association for Computational Linguistics.
Cite (Informal):
Dynamic Reference Extraction and Linking across Multiple Scholarly Knowledge Graphs (Duran-Silva & Accuosto, WASP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.wasp-main.9.pdf