KAD: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral

Ayoub Hammal; Pierre Zweigenbaum; Caio Corro

KAD: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral

Ayoub Hammal, Pierre Zweigenbaum, Caio Corro

Abstract

Several previous works concluded that the largest part of generation capabilities of large language models (LLM) are learned (early) during pre-training. However, LLMs still require further alignment to adhere to downstream task requirements and stylistic preferences, among other desired properties. As LLMs continue to scale in terms of size, the computational cost of alignment procedures increase prohibitively.In this work, we propose a novel approach to circumvent these costs via proxy-based test-time alignment, i.e. using guidance from a small aligned model. Our approach can be described as a token-specific cascading method, where the token-specific deferral rule is reduced to 0-1 knapsack problem. In this setting, we derive primal and dual approximations of the optimal deferral decision. We experimentally show the benefits of our method both in task performance and speculative decoding speed.

Anthology ID:: 2026.eacl-long.179
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3854–3872
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.179/
DOI:
Bibkey:
Cite (ACL):: Ayoub Hammal, Pierre Zweigenbaum, and Caio Corro. 2026. KAD: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3854–3872, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: KAD: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral (Hammal et al., EACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.179.pdf

PDF Cite Search Fix data