PrAd: Prompt Adaptive Tuning for Decoder-only Language Models

Youneng Ma; Junyi He; Haojun Fei

doi:10.18653/v1/2025.findings-emnlp.254

PrAd: Prompt Adaptive Tuning for Decoder-only Language Models

Abstract

Fine tuning pretrained language models for downstream NLP tasks, while effective, can be costly when the model size and the number of tasks increase, as it requires full parameter updates and a separate model served for each task. Parameter-efficient tuning (PET) addresses the issue by keeping the pretrained parameters fixed while introducing minimal task-specific parameters. There are two essential PET paradigms: prompt-based tuning and adapter-based tuning, each with distinct limitations. Prompt-based methods suffer from increased input lengths and sensitivity to weight initialization, whereas adapter approaches can substantially increase inference time. To overcome these limitations, we propose prompt adaptive tuning (PrAd), a general prompt-based tuning framework for decode-only models that delivers strong performance with high efficiency, even in multi-task scenarios. Unlike conventional prompt-based tuning which uses soft tokens to “wrap” inputs, PrAd employs adapters for flexible input transformation. While traditional adapter-based tuning adapts both the prompt and decoded tokens, PrAd only adapts the prompt. PrAd enables the creation of diverse prompt-based approaches while providing critical advantages for real-world use: (1) it can maintain original input lengths with easy initialization during training, like adapter-based methods; (2) it can reduce management costs while facilitating deployment and efficient batch inference of different tasks, like prompt-based tuning.; and (3) it introduces no additional inference latency in the decoding phase even when serving multiple tasks concurrently. Experiments on six diverse tasks demonstrate that PrAd can consistently attain comparable or better performance and higher inference efficiency.

Anthology ID:: 2025.findings-emnlp.254
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4729–4743
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.254/
DOI:: 10.18653/v1/2025.findings-emnlp.254
Bibkey:
Cite (ACL):: Youneng Ma, Junyi He, and Haojun Fei. 2025. PrAd: Prompt Adaptive Tuning for Decoder-only Language Models. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 4729–4743, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: PrAd: Prompt Adaptive Tuning for Decoder-only Language Models (Ma et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.254.pdf
Checklist:: 2025.findings-emnlp.254.checklist.pdf

PDF Cite Search Checklist Fix data