Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function
Mingyang Song, Haiyun Jiang, Lemao Liu, Shuming Shi, Liping Jing
Abstract
We create a paradigm shift concerning building unsupervised keyphrase extraction systems in this paper. Instead of modeling the relevance between an individual candidate phrase and the document as in the commonly used framework, we formulate the unsupervised keyphrase extraction task as a document-set matching problem from a set-wise perspective, in which the document and the candidate set are globally matched in the semantic space to particularly take into account the interactions among all candidate phrases. Since it is intractable to exactly extract the keyphrase set by the matching function during the inference, we propose an approximate approach, which obtains the candidate subsets via a set extractor agent learned by reinforcement learning. Exhaustive experimental results demonstrate the effectiveness of our model, which outperforms the recent state-of-the-art unsupervised keyphrase extraction baselines by a large margin.- Anthology ID:
- 2023.findings-acl.156
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2023
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Editors:
- Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2482–2494
- Language:
- URL:
- https://aclanthology.org/2023.findings-acl.156
- DOI:
- 10.18653/v1/2023.findings-acl.156
- Cite (ACL):
- Mingyang Song, Haiyun Jiang, Lemao Liu, Shuming Shi, and Liping Jing. 2023. Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function. In Findings of the Association for Computational Linguistics: ACL 2023, pages 2482–2494, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function (Song et al., Findings 2023)
- PDF:
- https://preview.aclanthology.org/emnlp22-frontmatter/2023.findings-acl.156.pdf