Document Attribution: Examining Citation Relationships using Large Language Models

Vipula Rawte, Ryan A. Rossi, Franck Dernoncourt, Nedim Lipka


Abstract
As Large Language Models (LLMs) are increasingly applied to document-based tasks - such as document summarization, question answering, and information extraction - where user requirements focus on retrieving information from provided documents rather than relying on the model’s parametric knowledge, ensuring the trustworthiness and interpretability of these systems has become a critical concern. A central approach to addressing this challenge is attribution, which involves tracing the generated outputs back to their source documents. However, since LLMs can produce inaccurate or imprecise responses, it is crucial to assess the reliability of these citations.To tackle this, our work proposes two techniques. (1) A zero-shot approach that frames attribution as a straightforward textual entailment task. Our method using flan-ul2 demonstrates an improvement of 0.27% and 2.4% over the best baseline of ID and OOD sets of AttributionBench (CITATION), respectively. (2) We also explore the role of the attention mechanism in enhancing the attribution process. Using a smaller LLM, flan-t5-small, the F1 scores outperform the baseline across almost all layers except layer 4 and layers 8 through 11.
Anthology ID:
2025.sdp-1.12
Volume:
Proceedings of the Fifth Workshop on Scholarly Document Processing (SDP 2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Tirthankar Ghosal, Philipp Mayr, Amanpreet Singh, Aakanksha Naik, Georg Rehm, Dayne Freitag, Dan Li, Sonja Schimmler, Anita De Waard
Venues:
sdp | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
132–136
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.sdp-1.12/
DOI:
Bibkey:
Cite (ACL):
Vipula Rawte, Ryan A. Rossi, Franck Dernoncourt, and Nedim Lipka. 2025. Document Attribution: Examining Citation Relationships using Large Language Models. In Proceedings of the Fifth Workshop on Scholarly Document Processing (SDP 2025), pages 132–136, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Document Attribution: Examining Citation Relationships using Large Language Models (Rawte et al., sdp 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.sdp-1.12.pdf