Abstract
Coherence is a crucial feature of text because it is indispensable for conveying its communication purpose and meaning to its readers. In this paper, we propose an unsupervised text coherence scoring based on graph construction in which edges are established between semantically similar sentences represented by vertices. The sentence similarity is calculated based on the cosine similarity of semantic vectors representing sentences. We provide three graph construction methods establishing an edge from a given vertex to a preceding adjacent vertex, to a single similar vertex, or to multiple similar vertices. We evaluated our methods in the document discrimination task and the insertion task by comparing our proposed methods to the supervised (Entity Grid) and unsupervised (Entity Graph) baselines. In the document discrimination task, our method outperformed the unsupervised baseline but could not do the supervised baseline, while in the insertion task, our method outperformed both baselines.- Anthology ID:
- W17-2410
- Volume:
- Proceedings of TextGraphs-11: the Workshop on Graph-based Methods for Natural Language Processing
- Month:
- August
- Year:
- 2017
- Address:
- Vancouver, Canada
- Editors:
- Martin Riedl, Swapna Somasundaran, Goran Glavaš, Eduard Hovy
- Venue:
- TextGraphs
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 76–85
- Language:
- URL:
- https://aclanthology.org/W17-2410
- DOI:
- 10.18653/v1/W17-2410
- Cite (ACL):
- Jan Wira Gotama Putra and Takenobu Tokunaga. 2017. Evaluating text coherence based on semantic similarity graph. In Proceedings of TextGraphs-11: the Workshop on Graph-based Methods for Natural Language Processing, pages 76–85, Vancouver, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Evaluating text coherence based on semantic similarity graph (Putra & Tokunaga, TextGraphs 2017)
- PDF:
- https://preview.aclanthology.org/dois-2013-emnlp/W17-2410.pdf