Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models

Injae Na; Keonwoong Noh; Woohwan Jung

Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models

Abstract

LLM providers typically offer multiple LLM tiers, varying in performance and price. As NLP tasks become more complex and modularized, selecting the suitable LLM tier for each subtask is a key challenge to balance between cost and performance. To address the problem, we introduce LLM Automatic Transmission (LLM-AT) framework that automatically selects LLM tiers without training. LLM-AT consists of Starter, Generator, and Judge. The starter selects the initial LLM tier expected to solve the given question, the generator produces a response using the LLM of the selected tier, and the judge evaluates the validity of the response. If the response is invalid, LLM-AT iteratively upgrades to a higher-tier model, generates a new response, and re-evaluates until a valid response is obtained. Additionally, we propose accuracy estimator, which enables the suitable initial LLM tier selection without training. Given an input question, accuracy estimator estimates the expected accuracy of each LLM tier by computing the valid response rate across top-k similar queries from past inference records. Experiments demonstrate that LLM-AT achieves superior performance while reducing costs, making it a practical solution for real-world applications.

Anthology ID:: 2025.findings-acl.873
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 16987–17004
Language:
URL:: https://preview.aclanthology.org/acl25-workshop-ingestion/2025.findings-acl.873/
DOI:
Bibkey:
Cite (ACL):: Injae Na, Keonwoong Noh, and Woohwan Jung. 2025. Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2025, pages 16987–17004, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models (Na et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/acl25-workshop-ingestion/2025.findings-acl.873.pdf

PDF Cite Search Fix data