Biased TextRank: Unsupervised Graph-Based Content Extraction

Ashkan Kazemi, Verónica Pérez-Rosas, Rada Mihalcea


Abstract
We introduce Biased TextRank, a graph-based content extraction method inspired by the popular TextRank algorithm that ranks text spans according to their importance for language processing tasks and according to their relevance to an input “focus.” Biased TextRank enables focused content extraction for text by modifying the random restarts in the execution of TextRank. The random restart probabilities are assigned based on the relevance of the graph nodes to the focus of the task. We present two applications of Biased TextRank: focused summarization and explanation extraction, and show that our algorithm leads to improved performance on two different datasets by significant ROUGE-N score margins. Much like its predecessor, Biased TextRank is unsupervised, easy to implement and orders of magnitude faster and lighter than current state-of-the-art Natural Language Processing methods for similar tasks.
Anthology ID:
2020.coling-main.144
Volume:
Proceedings of the 28th International Conference on Computational Linguistics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
1642–1652
Language:
URL:
https://aclanthology.org/2020.coling-main.144
DOI:
10.18653/v1/2020.coling-main.144
Bibkey:
Cite (ACL):
Ashkan Kazemi, Verónica Pérez-Rosas, and Rada Mihalcea. 2020. Biased TextRank: Unsupervised Graph-Based Content Extraction. In Proceedings of the 28th International Conference on Computational Linguistics, pages 1642–1652, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):
Biased TextRank: Unsupervised Graph-Based Content Extraction (Kazemi et al., COLING 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/2020.coling-main.144.pdf