The Effect of Pretraining on Extractive Summarization for Scientific Documents

Yash Gupta; Pawan Sasanka Ammanamanchi; Shikha Bordia; Arjun Manoharan; Deepak Mittal; Ramakanth Pasunuru; Manish Shrivastava; Maneesh Singh; Mohit Bansal; Preethi Jyothi

doi:10.18653/v1/2021.sdp-1.9

The Effect of Pretraining on Extractive Summarization for Scientific Documents

Yash Gupta, Pawan Sasanka Ammanamanchi, Shikha Bordia, Arjun Manoharan, Deepak Mittal, Ramakanth Pasunuru, Manish Shrivastava, Maneesh Singh, Mohit Bansal, Preethi Jyothi

Abstract

Large pretrained models have seen enormous success in extractive summarization tasks. In this work, we investigate the influence of pretraining on a BERT-based extractive summarization system for scientific documents. We derive significant performance improvements using an intermediate pretraining step that leverages existing summarization datasets and report state-of-the-art results on a recently released scientific summarization dataset, SciTLDR. We systematically analyze the intermediate pretraining step by varying the size and domain of the pretraining corpus, changing the length of the input sequence in the target task and varying target tasks. We also investigate how intermediate pretraining interacts with contextualized word embeddings trained on different domains.

Anthology ID:: 2021.sdp-1.9
Volume:: Proceedings of the Second Workshop on Scholarly Document Processing
Month:: June
Year:: 2021
Address:: Online
Venues:: NAACL | sdp
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 73–82
Language:
URL:: https://aclanthology.org/2021.sdp-1.9
DOI:: 10.18653/v1/2021.sdp-1.9
Bibkey:
Cite (ACL):: Yash Gupta, Pawan Sasanka Ammanamanchi, Shikha Bordia, Arjun Manoharan, Deepak Mittal, Ramakanth Pasunuru, Manish Shrivastava, Maneesh Singh, Mohit Bansal, and Preethi Jyothi. 2021. The Effect of Pretraining on Extractive Summarization for Scientific Documents. In Proceedings of the Second Workshop on Scholarly Document Processing, pages 73–82, Online. Association for Computational Linguistics.
Cite (Informal):: The Effect of Pretraining on Extractive Summarization for Scientific Documents (Gupta et al., sdp 2021)
Copy Citation:
PDF:: https://preview.aclanthology.org/update-css-js/2021.sdp-1.9.pdf

PDF Cite Search