HFT at BEA 2026 Shared Task 2: Blunt-Edge Models for Hybrid Grading

Ulrike Pado


Abstract
Open-source LLMs with simple, zero-shot prompts are at best middling graders on the BEA 2026 Automated Grading Shared Task – blunt-edge models, in fact. However, they are good enough to support human graders and save them time. We demonstrate the application of a hybrid grading approach that first transparently defines the success criteria and then pairs a zero-shot LLM grader with human review. The hybrid approach outperforms the LLM grader on its own and has the added advantage of keeping the human in the loop.
Anthology ID:
2026.bea-1.86
Volume:
Proceedings of the 21st Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Bashar Alhafni, Stefano Bannò, Marie Bexte, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Anais Tack, Victoria Yaneva, Zheng Yuan
Venues:
BEA | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1193–1200
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bea-1.86/
DOI:
Bibkey:
Cite (ACL):
Ulrike Pado. 2026. HFT at BEA 2026 Shared Task 2: Blunt-Edge Models for Hybrid Grading. In Proceedings of the 21st Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2026), pages 1193–1200, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
HFT at BEA 2026 Shared Task 2: Blunt-Edge Models for Hybrid Grading (Pado, BEA 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bea-1.86.pdf