EpiCoDe: Boosting Model Performance Beyond Training with Extrapolation and Contrastive Decoding

Mingxu Tao, Jie Hu, Mingchuan Yang, Yunhuai Liu, Dongyan Zhao, Yansong Feng


Abstract
The remarkable performance of Large language models (LLMs) relies heavily on the availability of abundant high-quality training data. However, the high cost of acquiring annotated data often prevents models from obtaining capabilities to tackle downstream tasks. In this paper, we introduce a novel method, EpiCoDe that boosts model performance in data-scarcity scenarios without extra training. We first employ model extrapolation to enhance a finetuned model with its inferior version, and then adopt contrastive decoding to further reduce predicted errors, by comparing the logit scores given by the extrapolated and the vanilla finetuned model. Experiments across three domains over four different LLMs show that EpiCoDe consistently outperforms existing methods with significant and robust improvement. We also propose a new theoretical framework to reveal the mechanism behind contrastive decoding in data-scarcity scenarios, which further helps better understand the effectiveness of our EpiCoDe.
Anthology ID:
2025.findings-acl.769
Volume:
Findings of the Association for Computational Linguistics: ACL 2025
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
14874–14885
Language:
URL:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.findings-acl.769/
DOI:
10.18653/v1/2025.findings-acl.769
Bibkey:
Cite (ACL):
Mingxu Tao, Jie Hu, Mingchuan Yang, Yunhuai Liu, Dongyan Zhao, and Yansong Feng. 2025. EpiCoDe: Boosting Model Performance Beyond Training with Extrapolation and Contrastive Decoding. In Findings of the Association for Computational Linguistics: ACL 2025, pages 14874–14885, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
EpiCoDe: Boosting Model Performance Beyond Training with Extrapolation and Contrastive Decoding (Tao et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.findings-acl.769.pdf