LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering

Patrick Sutanto; Joan Santoso; Esther Irawati Setiawan; Aji Prasetya Wibawa

doi:10.18653/v1/2025.findings-emnlp.452

LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering

Patrick Sutanto, Joan Santoso, Esther Irawati Setiawan, Aji Prasetya Wibawa

Abstract

Encoder models offer efficiency for specific tasks, but their performance depend on data availability. While Large Language Models (LLMs) excel at few-shot learning, their direct application in real-world scenarios is often hindered by their high computational cost. To address this challenge, we propose a simple yet effective approach that uses LLMs for data generation and scoring to improve encoder only model performance. We evaluate this framework on few-shot Multiple Choice Question Answering (MCQA), an important task where acquiring labeled data is costly. Our approach utilizes LLMs to create MCQA questions and choices (exploring both direct JSON and decomposed generation methods) and assigns probability scores to these choices. This generated data and the LLM scores are then used to fine-tune smaller and more efficient DeBERTa-v3-base using distillation loss. Extensive experiments on the MMLU benchmark demonstrate that our method improves accuracy from 28.9% to 39.3%, representing a gain of over 10% compared to a baseline finetuned directly on 5-shot examples. This shows the effectiveness of LLM-driven data generation and knowledge distillation for few-shot MCQA.

Anthology ID:: 2025.findings-emnlp.452
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8502–8530
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.452/
DOI:: 10.18653/v1/2025.findings-emnlp.452
Bibkey:
Cite (ACL):: Patrick Sutanto, Joan Santoso, Esther Irawati Setiawan, and Aji Prasetya Wibawa. 2025. LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 8502–8530, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering (Sutanto et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.452.pdf
Checklist:: 2025.findings-emnlp.452.checklist.pdf

PDF Cite Search Checklist Fix data