Can Large Language Models Unlock Novel Scientific Research Ideas?

Sandeep Kumar; Tirthankar Ghosal; Vinayak Goyal; Asif Ekbal

Can Large Language Models Unlock Novel Scientific Research Ideas?

Sandeep Kumar, Tirthankar Ghosal, Vinayak Goyal, Asif Ekbal

Abstract

The widespread adoption of Large Language Models (LLMs) and publicly available ChatGPT have marked a significant turning point in the integration of Artificial Intelligence (AI) into people’s everyday lives. This study explores the capability of LLMs in generating novel research ideas based on information from research papers. We conduct a thorough examination of 4 LLMs in five domains (e.g., Chemistry, Computer, Economics, Medical, and Physics). We found that the future research ideas generated by Claude-2 and GPT-4 are more aligned with the author’s perspective than GPT-3.5 and Gemini. We also found that Claude-2 generates more diverse future research ideas than GPT-4, GPT-3.5, and Gemini 1.0. We further performed a human evaluation of the novelty, relevancy, and feasibility of the generated future research ideas. This investigation offers insights into the evolving role of LLMs in idea generation, highlighting both its capability and limitations. Our work contributes to the ongoing efforts in evaluating and utilizing language models for generating future research ideas. We make our datasets and codes publicly available.

Anthology ID:: 2025.emnlp-main.1704
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 33551–33575
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1704/
DOI:
Bibkey:
Cite (ACL):: Sandeep Kumar, Tirthankar Ghosal, Vinayak Goyal, and Asif Ekbal. 2025. Can Large Language Models Unlock Novel Scientific Research Ideas?. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 33551–33575, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Can Large Language Models Unlock Novel Scientific Research Ideas? (Kumar et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1704.pdf
Checklist:: 2025.emnlp-main.1704.checklist.pdf

PDF Cite Search Checklist Fix data