When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges

Parth Darshan; Abhishek Divekar

When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges

Abstract

Customizing an LLM judge to a specific task or domain often involves optimizing its prompt across multiple evaluation criteria simultaneously. Textual gradient methods automate this for a single judge criterion, however they produce natural-language critiques, not numerical vectors. Thus, the conflict-resolution toolkit of multi-task learning (PCGrad, MGDA) doesn’t apply to the multi-objective textual gradient setting. We test five decomposition modes of textual gradient optimizers by varying how much cross-task information the loss, gradient and optimizer LLMs share. In 6 of 10 configurations on SummEval, we observe that optimization never improves over the initial prompt. Gradient specificity drops by 59% (from 9.0 to 3.7) when the gradient LLM processes multiple criteria jointly. Separately, we observe that naively combining per-task instructions into a single prompt degrades Spearman’s ρ by -5.3%. These results identify two separable failure modes: optimization-time gradient dilution and inference-time instruction interference, which together constrain the design space for multi-objective judge customization using textual feedback.

Anthology ID:: 2026.customnlp4u-1.21
Volume:: Proceedings of the Second Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Sheshera Mysore, Sachin Kumar, Vidhisha Balachandran, Shirley Anugrah Hayati, Faeze Brahman, Hanane Nour Moussa, Alireza Salemi
Venues:: CustomNLP4U | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 216–228
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.customnlp4u-1.21/
DOI:
Bibkey:
Cite (ACL):: Parth Darshan and Abhishek Divekar. 2026. When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges. In Proceedings of the Second Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U), pages 216–228, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges (Darshan & Divekar, CustomNLP4U 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.customnlp4u-1.21.pdf

PDF Cite Search Fix data