Automated Telescope-Paper Linkage via Multi-Model Ensemble Learning
Ojaswa Ojaswa Varshney, Prashasti Vyas, Priyanka Goyal, Tarpita Singh, Ritesh Kumar, Mayank Singh
Abstract
Automated linkage between scientific publications and telescope datasets is a cornerstone for scalable bibliometric analyses and ensuring scientific reproducibility in astrophysics. We propose a multi-model ensemble architecture integrating transformer models DeBERTa, RoBERTa, and TF-IDF logistic regression, tailored to the WASP-2025 shared task on telescope-paper classification. Our approach achieves a macro F1 score approaching 0.78 after extensive multi-seed ensembling and per-label threshold tuning, significantly outperforming baseline models. This paper presents comprehensive methodology, ablation studies, and an in-depth discussion of challenges, establishing a robust benchmark for scientific bibliometric task automation.- Anthology ID:
- 2025.wasp-main.15
- Volume:
- Proceedings of the Third Workshop for Artificial Intelligence for Scientific Publications
- Month:
- December
- Year:
- 2025
- Address:
- Mumbai, India and virtual
- Editors:
- Alberto Accomazzi, Tirthankar Ghosal, Felix Grezes, Kelly Lockhart
- Venues:
- WASP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 127–135
- Language:
- URL:
- https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.wasp-main.15/
- DOI:
- Cite (ACL):
- Ojaswa Ojaswa Varshney, Prashasti Vyas, Priyanka Goyal, Tarpita Singh, Ritesh Kumar, and Mayank Singh. 2025. Automated Telescope-Paper Linkage via Multi-Model Ensemble Learning. In Proceedings of the Third Workshop for Artificial Intelligence for Scientific Publications, pages 127–135, Mumbai, India and virtual. Association for Computational Linguistics.
- Cite (Informal):
- Automated Telescope-Paper Linkage via Multi-Model Ensemble Learning (Ojaswa Varshney et al., WASP 2025)
- PDF:
- https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.wasp-main.15.pdf