SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

Tu Vu; Brian Lester; Noah Constant; Rami Al-Rfou’; Daniel Cer

doi:10.18653/v1/2022.acl-long.346

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

Tu Vu, Brian Lester, Noah Constant, Rami Al-Rfou’, Daniel Cer

Abstract

There has been growing interest in parameter-efficient methods to apply pre-trained language models to downstream tasks. Building on the Prompt Tuning approach of Lester et al. (2021), which learns task-specific soft prompts to condition a frozen pre-trained model to perform different tasks, we propose a novel prompt-based transfer learning approach called SPoT: Soft Prompt Transfer. SPoT first learns a prompt on one or more source tasks and then uses it to initialize the prompt for a target task. We show that SPoT significantly boosts the performance of Prompt Tuning across many tasks. More remarkably, across all model sizes, SPoT matches or outperforms standard Model Tuning (which fine-tunes all model parameters) on the SuperGLUE benchmark, while using up to 27,000× fewer task-specific parameters. To understand where SPoT is most effective, we conduct a large-scale study on task transferability with 26 NLP tasks in 160 combinations, and demonstrate that many tasks can benefit each other via prompt transfer. Finally, we propose an efficient retrieval approach that interprets task prompts as task embeddings to identify similar tasks and predict the most transferable source tasks for a novel target task.

Anthology ID:: 2022.acl-long.346
Volume:: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: May
Year:: 2022
Address:: Dublin, Ireland
Editors:: Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5039–5059
Language:
URL:: https://aclanthology.org/2022.acl-long.346
DOI:: 10.18653/v1/2022.acl-long.346
Bibkey:
Cite (ACL):: Tu Vu, Brian Lester, Noah Constant, Rami Al-Rfou’, and Daniel Cer. 2022. SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5039–5059, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):: SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer (Vu et al., ACL 2022)
Copy Citation:
PDF:: https://preview.aclanthology.org/naacl24-info/2022.acl-long.346.pdf
Video:: https://preview.aclanthology.org/naacl24-info/2022.acl-long.346.mp4
Data: BoolQ, C4, COPA, CoLA, CosmosQA, CxC, DROP, GEM, GLUE, HellaSwag, MRPC, MRQA, MultiNLI, MultiRC, QNLI, RACE, Rainbow, ReCoRD, SQuAD, SST, SST-2, WSC, WiC, WinoGrande

PDF Search Video