Brian Jin
2026
Using a Human-AI Teaming Approach to Create and Curate Scientific Datasets with the SciLire System
Necva Bölücü | Jessica Irons | Changhyun Lee | Brian Jin | Maciej Rybinski | Huichen Yang | Andreas Duenser | Stephen Wan
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Necva Bölücü | Jessica Irons | Changhyun Lee | Brian Jin | Maciej Rybinski | Huichen Yang | Andreas Duenser | Stephen Wan
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations)
The rapid growth of scientific literature has made manual extraction of structured knowledge increasingly impractical. To address this challenge, we introduce SCILIRE, a system for creating datasets from scientific literature. SCILIRE has been designed around Human-AI teaming principles centred on workflows for verifying and curating data. It facilitates an iterative workflow in which researchers can review and correct AI outputs. Furthermore, this interaction is used as a feedback signal to improve future LLM-based inference. We evaluate our design using a combination of intrinsic benchmarking outcomes together with real-world case studies across multiple domains. The results demonstrate that SCILIRE improves extraction fidelity and facilitates efficient dataset creation.
2016
CSIRO Data61 at the WNUT Geo Shared Task
Gaya Jayasinghe | Brian Jin | James Mchugh | Bella Robinson | Stephen Wan
Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT)
Gaya Jayasinghe | Brian Jin | James Mchugh | Bella Robinson | Stephen Wan
Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT)
In this paper, we describe CSIRO Data61’s participation in the Geolocation shared task at the Workshop for Noisy User-generated Text. Our approach was to use ensemble methods to capitalise on four component methods: heuristics based on metadata, a label propagation method, timezone text classifiers, and an information retrieval approach. The ensembles we explored focused on examining the role of language technologies in geolocation prediction and also in examining the use of hard voting and cascading ensemble methods. Based on the accuracy of city-level predictions, our systems were the best performing submissions at this year’s shared task. Furthermore, when estimating the latitude and longitude of a user, our median error distance was accurate to within 30 kilometers.