PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models

ChenZhuo Zhao; Ziqian Liu; Xinda Wang; Junting Lu; Chaoyi Ruan

doi:10.18653/v1/2025.findings-emnlp.795

PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models

ChenZhuo Zhao, Ziqian Liu, Xinda Wang, Junting Lu, Chaoyi Ruan

Abstract

Prompt optimization is a practical and widely applicable alternative to fine tuning for improving large language model performance. Yet many existing methods evaluate candidate prompts by sampling full outputs, often coupled with self critique or human annotated preferences, which limits scalability, especially for smaller models or models that are not instruction tuned. We present PMPO (Probabilistic Metric Prompt Optimization), a unified framework that uses token level cross entropy as a direct, lightweight evaluation signal. PMPO locates low quality prompt segments via a masking based analysis and iteratively rewrites them to propose improved variants. Crucially, during evaluation, PMPO selects among variants by minimizing loss in a single forward pass, eliminating output sampling and human or judge based scoring for selection while still using standard generation only to propose rewrites. This unified, loss based strategy supports both supervised and preference based tasks. Across model sizes and datasets, PMPO outperforms prior prompt optimizers: it achieves the highest average accuracy on BBH, performs strongly on GSM8K and AQuA RAT, and raises AlpacaEval 2.0 win rates by over 19 points. These results demonstrate PMPO’s effectiveness, efficiency, and broad applicability.

Anthology ID:: 2025.findings-emnlp.795
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 14728–14761
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.795/
DOI:: 10.18653/v1/2025.findings-emnlp.795
Bibkey:
Cite (ACL):: ChenZhuo Zhao, Ziqian Liu, Xinda Wang, Junting Lu, and Chaoyi Ruan. 2025. PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 14728–14761, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models (Zhao et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.795.pdf
Checklist:: 2025.findings-emnlp.795.checklist.pdf

PDF Cite Search Checklist Fix data