CiteLab: Developing and Diagnosing LLM Citation Generation Workflows via the Human-LLM Interaction

Jiajun Shen, Tong Zhou, Yubo Chen, Kang Liu, Jun Zhao


Abstract
The emerging paradigm of enabling Large Language Models (LLMs) to generate citations in Question-Answering (QA) tasks is lacking in a unified framework to standardize and fairly compare different citation generation methods, leading to difficulties in reproduction and innovation. Therefore, we introduce Citeflow, an open-source and modular framework fostering reproduction and the implementation of new designs. Citeflow is highly extensible, allowing users to utilize four main modules and 14 components to construct a pipeline, evaluate an existing method, and understand the attributing LLM-generated contents. The framework is also paired with a visual interface, Citefix, facilitating case study and modification of existing citation generation methods. Users can use this interface to conduct LLM-powered case studies according to different scenarios. Citeflow and Citefix are highly integrated into the toolkit CiteLab, and we use an authentic process of multiple rounds of improvement through the Human-LLM interaction interface to demonstrate the efficiency of our toolkit on implementing and modifying citation generation pipelines. Citelab is released at https://github.com/SjJ1017/Citelab
Anthology ID:
2025.acl-demo.47
Volume:
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Pushkar Mishra, Smaranda Muresan, Tao Yu
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
490–501
Language:
URL:
https://preview.aclanthology.org/ingestion-acl-25/2025.acl-demo.47/
DOI:
Bibkey:
Cite (ACL):
Jiajun Shen, Tong Zhou, Yubo Chen, Kang Liu, and Jun Zhao. 2025. CiteLab: Developing and Diagnosing LLM Citation Generation Workflows via the Human-LLM Interaction. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 490–501, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
CiteLab: Developing and Diagnosing LLM Citation Generation Workflows via the Human-LLM Interaction (Shen et al., ACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-acl-25/2025.acl-demo.47.pdf
Copyright agreement:
 2025.acl-demo.47.copyright_agreement.pdf