Leveraging Moment Injection for Enhanced Semi-supervised Natural Language Inference with Large Language Models

Seo Yeon Park


Abstract
Natural Language Inference (NLI) is crucial for evaluating models’ Natural Language Understanding (NLU) and reasoning abilities. The development of NLI, in part, has been driven by the creation of large datasets, which require significant human effort. This has spurred interest in semi-supervised learning (SSL) that leverages both labeled and unlabeled data. However, the absence of hypotheses and class labels in NLI tasks complicates SSL. Prior work has used class-specific fine-tuned large language models (LLMs) to generate hypotheses and assign pseudo-labels but discarded many LLM-constructed samples during training to ensure the quality. In contrast, we propose to leverage all LLM-constructed samples by handling potentially noisy samples by injecting the moments of labeled samples during training to properly adjust the level of noise. Our method outperforms strong baselines on multiple NLI datasets in low-resource settings.
Anthology ID:
2025.naacl-short.54
Volume:
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
641–648
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.naacl-short.54/
DOI:
Bibkey:
Cite (ACL):
Seo Yeon Park. 2025. Leveraging Moment Injection for Enhanced Semi-supervised Natural Language Inference with Large Language Models. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 641–648, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Leveraging Moment Injection for Enhanced Semi-supervised Natural Language Inference with Large Language Models (Park, NAACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.naacl-short.54.pdf