FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations

Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal


Abstract
Despite recent improvements in abstractive summarization, most current approaches generate summaries that are not factually consistent with the source document, severely restricting their trust and usage in real-world applications. Recent works have shown promising improvements in factuality error identification using text or dependency arc entailments; however, they do not consider the entire semantic graph simultaneously. To this end, we propose FactGraph, a method that decomposes the document and the summary into structured meaning representations (MR), which are more suitable for factuality evaluation. MRs describe core semantic concepts and their relations, aggregating the main content in both document and summary in a canonical form, and reducing data sparsity. FactGraph encodes such graphs using a graph encoder augmented with structure-aware adapters to capture interactions among the concepts based on the graph connectivity, along with text representations using an adapter-based text encoder. Experiments on different benchmarks for evaluating factuality show that FactGraph outperforms previous approaches by up to 15%. Furthermore, FactGraph improves performance on identifying content verifiability errors and better captures subsentence-level factual inconsistencies.
Anthology ID:
2022.naacl-main.236
Volume:
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Marine Carpuat, Marie-Catherine de Marneffe, Ivan Vladimir Meza Ruiz
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3238–3253
Language:
URL:
https://aclanthology.org/2022.naacl-main.236
DOI:
10.18653/v1/2022.naacl-main.236
Bibkey:
Cite (ACL):
Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, and Mohit Bansal. 2022. FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3238–3253, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations (Ribeiro et al., NAACL 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl24-info/2022.naacl-main.236.pdf
Software:
 2022.naacl-main.236.software.zip
Video:
 https://preview.aclanthology.org/naacl24-info/2022.naacl-main.236.mp4
Code
 amazon-research/fact-graph +  additional community code
Data
CNN/Daily Mail