Latifah Kamarudin
2025
TeG-DRec: Inductive Text-Graph Learning for Unseen Node Scientific Dataset Recommendation
Ammar Qayyum
|
Bassamtiano Irnawan
|
Fumiyo Fukumoto
|
Latifah Kamarudin
|
Kentaro Go
|
Yoshimi Suzuki
Proceedings of the Third Workshop for Artificial Intelligence for Scientific Publications
Scientific datasets are crucial for evaluating scientific research, and their number is increasing rapidly. Most scientific dataset recommendation systems use Information Retrieval (IR) methods that model semantics while overlooking interactions. Graph Neural Networks (GNNs) excel at handling interactions between entities but often overlook textual content, limiting their ability to generalise to unseen nodes. We propose TeG-DRec, a framework for scientific dataset recommendation that integrates GNNs and textual content via a subgraph generation module to ensure correct propagation throughout the model, enabling handling of unseen data. Experimental results on the dataset recommendation’s dataset show that our method outperformed the baselines for text-based IR and graph-based recommendation systems. Our source code is available at https://github.com/Maqif14/TeG-DRec.git