Using a Human-AI Teaming Approach to Create and Curate Scientific Datasets with the SciLire System
Necva Bölücü, Jessica Irons, Changhyun Lee, Brian Jin, Maciej Rybinski, Huichen Yang, Andreas Duenser, Stephen Wan
Abstract
The rapid growth of scientific literature has made manual extraction of structured knowledge increasingly impractical. To address this challenge, we introduce SCILIRE, a system for creating datasets from scientific literature. SCILIRE has been designed around Human-AI teaming principles centred on workflows for verifying and curating data. It facilitates an iterative workflow in which researchers can review and correct AI outputs. Furthermore, this interaction is used as a feedback signal to improve future LLM-based inference. We evaluate our design using a combination of intrinsic benchmarking outcomes together with real-world case studies across multiple domains. The results demonstrate that SCILIRE improves extraction fidelity and facilitates efficient dataset creation.- Anthology ID:
- 2026.eacl-demo.30
- Volume:
- Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations)
- Month:
- March
- Year:
- 2026
- Address:
- Rabat, Marocco
- Editors:
- Danilo Croce, Jochen Leidner, Nafise Sadat Moosavi
- Venue:
- EACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 428–444
- Language:
- URL:
- https://preview.aclanthology.org/ingest-eacl/2026.eacl-demo.30/
- DOI:
- Cite (ACL):
- Necva Bölücü, Jessica Irons, Changhyun Lee, Brian Jin, Maciej Rybinski, Huichen Yang, Andreas Duenser, and Stephen Wan. 2026. Using a Human-AI Teaming Approach to Create and Curate Scientific Datasets with the SciLire System. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 428–444, Rabat, Marocco. Association for Computational Linguistics.
- Cite (Informal):
- Using a Human-AI Teaming Approach to Create and Curate Scientific Datasets with the SciLire System (Bölücü et al., EACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-eacl/2026.eacl-demo.30.pdf