Automatic Summarization for Creative Writing: BART based Pipeline Method for Generating Summary of Movie Scripts

Aditya Upadhyay, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh, Petr Motlicek


Abstract
This paper documents our approach for the Creative-Summ 2022 shared task for Automatic Summarization of Creative Writing. For this purpose, we develop an automatic summarization pipeline where we leverage a denoising autoencoder for pretraining sequence-to-sequence models and fine-tune it on a large-scale abstractive screenplay summarization dataset to summarize TV transcripts from primetime shows. Our pipeline divides the input transcript into smaller conversational blocks, removes redundant text, summarises the conversational blocks, obtains the block-wise summaries, cleans, structures, and then integrates the summaries to create the meeting minutes. Our proposed system achieves some of the best scores across multiple metrics(lexical, semantical) in the Creative-Summ shared task.
Anthology ID:
2022.creativesumm-1.7
Volume:
Proceedings of The Workshop on Automatic Summarization for Creative Writing
Month:
October
Year:
2022
Address:
Gyeongju, Republic of Korea
Venue:
CreativeSumm
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
44–50
Language:
URL:
https://aclanthology.org/2022.creativesumm-1.7
DOI:
Bibkey:
Cite (ACL):
Aditya Upadhyay, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh, and Petr Motlicek. 2022. Automatic Summarization for Creative Writing: BART based Pipeline Method for Generating Summary of Movie Scripts. In Proceedings of The Workshop on Automatic Summarization for Creative Writing, pages 44–50, Gyeongju, Republic of Korea. Association for Computational Linguistics.
Cite (Informal):
Automatic Summarization for Creative Writing: BART based Pipeline Method for Generating Summary of Movie Scripts (Upadhyay et al., CreativeSumm 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.creativesumm-1.7.pdf