Fact-based Text Editing

Hayate Iso, Chao Qiao, Hang Li


Abstract
We propose a novel text editing task, referred to as fact-based text editing, in which the goal is to revise a given document to better describe the facts in a knowledge base (e.g., several triples). The task is important in practice because reflecting the truth is a common requirement in text editing. First, we propose a method for automatically generating a dataset for research on fact-based text editing, where each instance consists of a draft text, a revised text, and several facts represented in triples. We apply the method into two public table-to-text datasets, obtaining two new datasets consisting of 233k and 37k instances, respectively. Next, we propose a new neural network architecture for fact-based text editing, called FactEditor, which edits a draft text by referring to given facts using a buffer, a stream, and a memory. A straightforward approach to address the problem would be to employ an encoder-decoder model. Our experimental results on the two datasets show that FactEditor outperforms the encoder-decoder approach in terms of fidelity and fluency. The results also show that FactEditor conducts inference faster than the encoder-decoder approach.
Anthology ID:
2020.acl-main.17
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2020
Address:
Online
Editors:
Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
171–182
Language:
URL:
https://preview.aclanthology.org/icon-24-ingestion/2020.acl-main.17/
DOI:
10.18653/v1/2020.acl-main.17
Bibkey:
Cite (ACL):
Hayate Iso, Chao Qiao, and Hang Li. 2020. Fact-based Text Editing. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 171–182, Online. Association for Computational Linguistics.
Cite (Informal):
Fact-based Text Editing (Iso et al., ACL 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/icon-24-ingestion/2020.acl-main.17.pdf
Video:
 http://slideslive.com/38929258
Code
 isomap/factedit
Data
RotoEditWebEdit