Many-Shot Scaling of In-Context Learning with Self-Generated Demonstrations

Zhengyao Gu, Henry Peng Zou, Yankai Chen, Aiwei Liu, Weizhi Zhang, Philip S. Yu


Abstract
The high cost of obtaining high-quality annotated data for in-context learning (ICL) has motivated the development of methods that use self-generated annotations in place of ground truth labels. While these approaches have shown promising results in few-shot settings, they generally do not scale to many-shot scenarios. In this work, we study ICL with self-generated examples using a framework analogous to traditional semi-supervised learning, consisting of annotation generation, demonstration selection, and in-context inference. Within this framework, we propose a simple baseline that outperforms ground truth ICL under zero-shot, few-shot, and many-shot settings. Notably, we observe consistent scaling behaviors with respect to the number of self-annotated demonstrations. To further extract performance from this many-shot capability, we introduce IterPSD, an iterative self-annotation approach that integrates iterative refinement and curriculum pseudo-labeling techniques from semi-supervised learning, yielding up to 6.8% additional gains on classification tasks. Motivated by our baseline and IterPSD results, we demonstrate that semi-supervised ICL offers a promising avenue for future ICL research.
Anthology ID:
2026.findings-acl.2013
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
40494–40508
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2013/
DOI:
Bibkey:
Cite (ACL):
Zhengyao Gu, Henry Peng Zou, Yankai Chen, Aiwei Liu, Weizhi Zhang, and Philip S. Yu. 2026. Many-Shot Scaling of In-Context Learning with Self-Generated Demonstrations. In Findings of the Association for Computational Linguistics: ACL 2026, pages 40494–40508, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Many-Shot Scaling of In-Context Learning with Self-Generated Demonstrations (Gu et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2013.pdf
Checklist:
 2026.findings-acl.2013.checklist.pdf