Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding

Fanyi Qu; Hao Sun; Yunfang Wu

doi:10.18653/v1/2024.findings-acl.47

Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding

Abstract

Within the context of reading comprehension, the task of Distractor Generation (DG) aims to generate several incorrect options to confuse readers. In recent years, the emergence of Large Language Models (LLMs) provides a potential for unsupervised DG without expensive human-annotated distractor labels. In this paper, we leverage LLMs as a cost-effective annotator to enhance the DG capability of smaller student models. To perform knowledge distilling, we propose a dual task training framework that integrates pseudo distractors from LLMs and answer information as the objective target with a two-stage training process. Moreover, we devise a counterfactual contrastive decoding mechanism for increasing the distracting capability of the DG model. Experiments show that our unsupervised generation method with Bart-base greatly surpasses GPT-3.5-turbo zero-shot performance with only 200× fewer model parameters. Our proposed unsupervised DG method offers a cost-effective framework for practical reading comprehension applications, without the need of laborious distractor annotation and costly large-size models.

Anthology ID:: 2024.findings-acl.47
Volume:: Findings of the Association for Computational Linguistics: ACL 2024
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 827–838
Language:
URL:: https://preview.aclanthology.org/build-pipeline-with-new-library/2024.findings-acl.47/
DOI:: 10.18653/v1/2024.findings-acl.47
Bibkey:
Cite (ACL):: Fanyi Qu, Hao Sun, and Yunfang Wu. 2024. Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding. In Findings of the Association for Computational Linguistics: ACL 2024, pages 827–838, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding (Qu et al., Findings 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/build-pipeline-with-new-library/2024.findings-acl.47.pdf

PDF Search