Adding Argumentation into Human Evaluation of Long Document Abstractive Summarization: A Case Study on Legal Opinions

Mohamed Elaraby; Huihui Xu; Morgan Gray; Kevin D. Ashley; Diane Litman

Adding Argumentation into Human Evaluation of Long Document Abstractive Summarization: A Case Study on Legal Opinions

Mohamed Elaraby, Huihui Xu, Morgan Gray, Kevin Ashley, Diane Litman

Abstract

Human evaluation remains the gold standard for assessing abstractive summarization. However, current practices often prioritize constructing evaluation guidelines for fluency, coherence, and factual accuracy, overlooking other critical dimensions. In this paper, we investigate argument coverage in abstractive summarization by focusing on long legal opinions, where summaries must effectively encapsulate the document’s argumentative nature. We introduce a set of human-evaluation guidelines to evaluate generated summaries based on argumentative coverage. These guidelines enable us to assess three distinct summarization models, studying the influence of including argument roles in summarization. Furthermore, we utilize these evaluation scores to benchmark automatic summarization metrics against argument coverage, providing insights into the effectiveness of automated evaluation methods.

Anthology ID:: 2024.humeval-1.3
Volume:: Proceedings of the Fourth Workshop on Human Evaluation of NLP Systems (HumEval) @ LREC-COLING 2024
Month:: May
Year:: 2024
Address:: Torino, Italia
Editors:: Simone Balloccu, Anya Belz, Rudali Huidrom, Ehud Reiter, Joao Sedoc, Craig Thomson
Venues:: HumEval | WS
SIG:
Publisher:: ELRA and ICCL
Note:
Pages:: 28–35
Language:
URL:: https://preview.aclanthology.org/nschneid-patch-2/2024.humeval-1.3/
DOI:
Bibkey:
Cite (ACL):: Mohamed Elaraby, Huihui Xu, Morgan Gray, Kevin Ashley, and Diane Litman. 2024. Adding Argumentation into Human Evaluation of Long Document Abstractive Summarization: A Case Study on Legal Opinions. In Proceedings of the Fourth Workshop on Human Evaluation of NLP Systems (HumEval) @ LREC-COLING 2024, pages 28–35, Torino, Italia. ELRA and ICCL.
Cite (Informal):: Adding Argumentation into Human Evaluation of Long Document Abstractive Summarization: A Case Study on Legal Opinions (Elaraby et al., HumEval 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-2/2024.humeval-1.3.pdf

PDF Cite Search Fix data