Zero-Shot Keyphrase Generation: Investigating Specialized Instructions and Multi-sample Aggregation on Large Language Models

Jishnu Ray Chowdhury; Jayanth Mohan; Tomas Malik; Cornelia Caragea

Zero-Shot Keyphrase Generation: Investigating Specialized Instructions and Multi-sample Aggregation on Large Language Models

Jishnu Ray Chowdhury, Jayanth Mohan, Tomas Malik, Cornelia Caragea

Abstract

Keyphrases are the essential topical phrases that summarize a document. Keyphrase generation is a long-standing NLP task for automatically generating keyphrases for a given document. While the task has been comprehensively explored in the past via various models, only a few works perform some preliminary analysis of Large Language Models (LLMs) for the task. Given the impact of LLMs in the field of NLP, it is important to conduct a more thorough examination of their potential for keyphrase generation. In this paper, we attempt to meet this demand with our research agenda. Specifically, we focus on the zero-shot capabilities of open-source instruction-tuned LLMs (Phi-3, Llama-3) and the closed-source GPT-4o for this task. We systematically investigate the effect of providing task-relevant specialized instructions in the prompt. Moreover, we design task-specific counterparts to self-consistency-style strategies for LLMs and show significant benefits from our proposals over the baselines.

Anthology ID:: 2025.findings-naacl.439
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7867–7884
Language:
URL:: https://preview.aclanthology.org/fix-sig-urls/2025.findings-naacl.439/
DOI:
Bibkey:
Cite (ACL):: Jishnu Ray Chowdhury, Jayanth Mohan, Tomas Malik, and Cornelia Caragea. 2025. Zero-Shot Keyphrase Generation: Investigating Specialized Instructions and Multi-sample Aggregation on Large Language Models. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 7867–7884, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Zero-Shot Keyphrase Generation: Investigating Specialized Instructions and Multi-sample Aggregation on Large Language Models (Ray Chowdhury et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/fix-sig-urls/2025.findings-naacl.439.pdf

PDF Cite Search Fix data