Heterogeneous Graph Neural Networks for Keyphrase Generation

Jiacheng Ye, Ruijian Cai, Tao Gui, Qi Zhang


Abstract
The encoder–decoder framework achieves state-of-the-art results in keyphrase generation (KG) tasks by predicting both present keyphrases that appear in the source document and absent keyphrases that do not. However, relying solely on the source document can result in generating uncontrollable and inaccurate absent keyphrases. To address these problems, we propose a novel graph-based method that can capture explicit knowledge from related references. Our model first retrieves some document-keyphrases pairs similar to the source document from a pre-defined index as references. Then a heterogeneous graph is constructed to capture relations with different levels of granularity of the source document and its retrieved references. To guide the decoding process, a hierarchical attention and copy mechanism is introduced, which directly copies appropriate words from both source document and its references based on their relevance and significance. The experimental results on multiple KG benchmarks show that the proposed model achieves significant improvements against other baseline models, especially with regard to the absent keyphrase prediction.
Anthology ID:
2021.emnlp-main.213
Volume:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2021
Address:
Online and Punta Cana, Dominican Republic
Editors:
Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2705–2715
Language:
URL:
https://aclanthology.org/2021.emnlp-main.213
DOI:
10.18653/v1/2021.emnlp-main.213
Bibkey:
Cite (ACL):
Jiacheng Ye, Ruijian Cai, Tao Gui, and Qi Zhang. 2021. Heterogeneous Graph Neural Networks for Keyphrase Generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 2705–2715, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
Heterogeneous Graph Neural Networks for Keyphrase Generation (Ye et al., EMNLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2021.emnlp-main.213.pdf
Video:
 https://preview.aclanthology.org/emnlp-22-attachments/2021.emnlp-main.213.mp4
Code
 jiacheng-ye/kg_gater
Data
KP20k