Sadaf Md Halim
2025
Let The Jury Decide: Fair Demonstration Selection for In-Context Learning through Incremental Greedy Evaluation
Sadaf Md Halim
|
Chen Zhao
|
Xintao Wu
|
Latifur Khan
|
Christan Grant
|
Fariha Ishrat Rahman
|
Feng Chen
Findings of the Association for Computational Linguistics: ACL 2025
Large Language Models (LLMs) are powerful in-context learners, achieving strong performance with just a few high-quality demonstrations. However, fairness concerns arise in many in-context classification tasks, especially when predictions involve sensitive attributes. To address this, we propose JUDGE—a simple yet effective framework for selecting fair and representative demonstrations that improve group fairness in In-Context Learning. JUDGE constructs the demonstration set iteratively using a greedy approach, guided by a small, carefully selected jury set. Our method remains robust across varying LLM architectures and datasets, ensuring consistent fairness improvements. We evaluate JUDGE on four datasets using four LLMs, comparing it against seven baselines. Results show that JUDGE consistently improves fairness metrics without compromising accuracy.
Search
Fix author
Co-authors
- Feng Chen 1
- Christan Grant 1
- Latifur Khan 1
- Fariha Ishrat Rahman 1
- Xintao Wu 1
- show all...