SAJA: A Simple Approach to Judge Alignment for LLM-as-a-Judge
Sneha Kola, Pankaj Kumar Sharma, Soumyadeep Dey, Bamdev Mishra, Mayur Datar
Abstract
LLM-as-a-Judge systems are increasingly used to evaluate text at scale, yet production deployment demands low latency, minimal cost, and compatibility with closed-source APIs. Current approaches fall short in different ways: some require many LLM calls and per-dataset prompt tuning, others depend on logit access unavailable in commercial APIs, and yet others demand multiple rounds of LLM interaction for iterative feature discovery. We present **SAJA** (**S**imple **A**pproach to **J**udge **A**lignment), built on the principle that task-specific alignment should reside in a lightweight calibration head, not in elaborate prompts or model internals. SAJA makes exactly one LLM call per item using a fixed structured rubric prompt, extracts a multi-dimensional feature vector, and maps it to a human-aligned score via a calibration head trained on a small number of human labels. No iterative prompt search, no logit access, and no multi-round LLM interaction are needed. Yet SAJA matches far more complex systems across four evaluation paradigms: 86% F1 on MT-Bench pairwise preference (vs. 78% uncalibrated), competitive performance on five classification benchmarks with a single call, and +5.71% F1 over prompt-optimized baselines on proprietary data. Ablations confirm that multi-dimensional rubric features outperform one-dimensional calibration (SummEval 𝜌 improves from 0.60 to 0.74) and that coarse rubric outputs recover the same human alignment as full logit distributions (𝜌 = 0.36 vs. 0.37), establishing that logit access is unnecessary for calibrated judge alignment. Moreover, SAJA is model-agnostic: a 9B open-source model with SAJA (𝜌=0.70) surpasses raw GPT-4.1 (𝜌=0.60). Its single-call design yields up to 4.8× cost savings over per-question approaches.- Anthology ID:
- 2026.acl-industry.45
- Volume:
- Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, USA
- Editors:
- Yunyao Li, Georg Rehm, Mei Tu
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 646–664
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.acl-industry.45/
- DOI:
- Cite (ACL):
- Sneha Kola, Pankaj Kumar Sharma, Soumyadeep Dey, Bamdev Mishra, and Mayur Datar. 2026. SAJA: A Simple Approach to Judge Alignment for LLM-as-a-Judge. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 646–664, San Diego, California, USA. Association for Computational Linguistics.
- Cite (Informal):
- SAJA: A Simple Approach to Judge Alignment for LLM-as-a-Judge (Kola et al., ACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.acl-industry.45.pdf