Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function

Mingyang Song, Haiyun Jiang, Lemao Liu, Shuming Shi, Liping Jing


Abstract
We create a paradigm shift concerning building unsupervised keyphrase extraction systems in this paper. Instead of modeling the relevance between an individual candidate phrase and the document as in the commonly used framework, we formulate the unsupervised keyphrase extraction task as a document-set matching problem from a set-wise perspective, in which the document and the candidate set are globally matched in the semantic space to particularly take into account the interactions among all candidate phrases. Since it is intractable to exactly extract the keyphrase set by the matching function during the inference, we propose an approximate approach, which obtains the candidate subsets via a set extractor agent learned by reinforcement learning. Exhaustive experimental results demonstrate the effectiveness of our model, which outperforms the recent state-of-the-art unsupervised keyphrase extraction baselines by a large margin.
Anthology ID:
2023.findings-acl.156
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2482–2494
Language:
URL:
https://aclanthology.org/2023.findings-acl.156
DOI:
10.18653/v1/2023.findings-acl.156
Bibkey:
Cite (ACL):
Mingyang Song, Haiyun Jiang, Lemao Liu, Shuming Shi, and Liping Jing. 2023. Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function. In Findings of the Association for Computational Linguistics: ACL 2023, pages 2482–2494, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function (Song et al., Findings 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp22-frontmatter/2023.findings-acl.156.pdf