MAPLE: Multi-Aspect Panels of LLM Evaluators for Open-Ended Questions

Michinori Jinji, Kyohei Atarashi, Koh Takeuchi, Hisashi Kashima


Abstract
LLM-as-a-Judge, which uses LLMs to evaluate responses to open-ended questions, has seen significant growth in recent years. It has been adopted as a scalable alternative to manual human evaluation, such as crowdsourcing, which is often time-consuming and costly. However, the discrepancy between LLM-generated evaluations and human evaluations remains a critical problem in this field. To bridge this gap, we propose Multi-Aspect Panels of LLM Evaluators (MAPLE), a framework that orchestrates evaluations across multiple criteria using multiple LLMs. MAPLE integrates criterion-wise pairwise evaluations from multiple LLMs by estimating the importance of criteria and the reliability of individual evaluators. We conduct experiments with both open-source and closed-source models. Our results demonstrate that MAPLE achieves superior alignment with human evaluations compared to baselines, highlighting the importance of employing multi-agent and multi-criteria evaluation strategies.
Anthology ID:
2026.findings-acl.1351
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
27071–27088
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1351/
DOI:
Bibkey:
Cite (ACL):
Michinori Jinji, Kyohei Atarashi, Koh Takeuchi, and Hisashi Kashima. 2026. MAPLE: Multi-Aspect Panels of LLM Evaluators for Open-Ended Questions. In Findings of the Association for Computational Linguistics: ACL 2026, pages 27071–27088, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
MAPLE: Multi-Aspect Panels of LLM Evaluators for Open-Ended Questions (Jinji et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1351.pdf
Checklist:
 2026.findings-acl.1351.checklist.pdf