Active Learning for Abstractive Text Summarization via LLM-Determined Curriculum and Certainty Gain Maximization

Dongyuan Li; Ying Zhang; Zhen Wang; Shiyin Tan; Satoshi Kosugi; Manabu Okumura

doi:10.18653/v1/2024.findings-emnlp.523

Active Learning for Abstractive Text Summarization via LLM-Determined Curriculum and Certainty Gain Maximization

Dongyuan Li, Ying Zhang, Zhen Wang, Shiyin Tan, Satoshi Kosugi, Manabu Okumura

Abstract

For abstractive text summarization, laborious data annotation and time-consuming model training become two high walls, hindering its further progress. Active Learning, selecting a few informative instances for annotation and model training, sheds light on solving these issues. However, only few active learning-based studies focus on abstractive text summarization and suffer from low stability, effectiveness, and efficiency. To solve the problems, we propose a novel LLM-determined curriculum active learning framework. Firstly, we design a prompt to ask large language models to rate the difficulty of instances, which guides the model to train on from easier to harder instances. Secondly, we design a novel active learning strategy, i.e., Certainty Gain Maximization, enabling to select instances whose distribution aligns well with the overall distribution. Experiments show our method can improve stability, effectiveness, and efficiency of abstractive text summarization backbones.

Anthology ID:: 2024.findings-emnlp.523
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2024
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8959–8971
Language:
URL:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.findings-emnlp.523/
DOI:: 10.18653/v1/2024.findings-emnlp.523
Bibkey:
Cite (ACL):: Dongyuan Li, Ying Zhang, Zhen Wang, Shiyin Tan, Satoshi Kosugi, and Manabu Okumura. 2024. Active Learning for Abstractive Text Summarization via LLM-Determined Curriculum and Certainty Gain Maximization. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 8959–8971, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Active Learning for Abstractive Text Summarization via LLM-Determined Curriculum and Certainty Gain Maximization (Li et al., Findings 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.findings-emnlp.523.pdf

PDF Cite Search Fix data