Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers

Tuo Zhang; Jinyue Yuan; Salman Avestimehr

Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers

Tuo Zhang, Jinyue Yuan, Salman Avestimehr

Abstract

Numerous recent works aim to enhance the efficacy of Large Language Models (LLMs) through strategic prompting. In particular, the Optimization by PROmpting (OPRO) approach provides state-of-the-art performance by leveraging LLMs as optimizers where the optimization task is to find instructions that maximize the task accuracy. In this paper, we revisit OPRO for automated prompting with relatively small-scale LLMs, such as LLaMa-2 family and Mistral 7B. Our investigation reveals that OPRO shows limited effectiveness in small-scale LLMs, with limited inference capabilities constraining optimization ability. We suggest future automatic prompting engineering to consider both model capabilities and computational costs. Additionally, for small-scale LLMs, we recommend direct instructions that clearly outline objectives and methodologies as robust prompt baselines, ensuring efficient and effective prompt engineering in ongoing research.

Anthology ID:: 2024.findings-acl.100
Volume:: Findings of the Association for Computational Linguistics ACL 2024
Month:: August
Year:: 2024
Address:: Bangkok, Thailand and virtual meeting
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1727–1735
Language:
URL:: https://aclanthology.org/2024.findings-acl.100
DOI:
Bibkey:
Cite (ACL):: Tuo Zhang, Jinyue Yuan, and Salman Avestimehr. 2024. Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers. In Findings of the Association for Computational Linguistics ACL 2024, pages 1727–1735, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.
Cite (Informal):: Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers (Zhang et al., Findings 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-4/2024.findings-acl.100.pdf

PDF Search