Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks

Trapit Bansal; Rishikesh Jha; Tsendsuren Munkhdalai; Andrew McCallum

doi:10.18653/v1/2020.emnlp-main.38

Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks

Trapit Bansal, Rishikesh Jha, Tsendsuren Munkhdalai, Andrew McCallum

Abstract

Self-supervised pre-training of transformer models has revolutionized NLP applications. Such pre-training with language modeling objectives provides a useful initial point for parameters that generalize well to new tasks with fine-tuning. However, fine-tuning is still data inefficient — when there are few labeled examples, accuracy can be low. Data efficiency can be improved by optimizing pre-training directly for future fine-tuning with few examples; this can be treated as a meta-learning problem. However, standard meta-learning techniques require many training tasks in order to generalize; unfortunately, finding a diverse set of such supervised tasks is usually difficult. This paper proposes a self-supervised approach to generate a large, rich, meta-learning task distribution from unlabeled text. This is achieved using a cloze-style objective, but creating separate multi-class classification tasks by gathering tokens-to-be blanked from among only a handful of vocabulary terms. This yields as many unique meta-training tasks as the number of subsets of vocabulary terms. We meta-train a transformer model on this distribution of tasks using a recent meta-learning framework. On 17 NLP tasks, we show that this meta-training leads to better few-shot generalization than language-model pre-training followed by finetuning. Furthermore, we show how the self-supervised tasks can be combined with supervised tasks for meta-learning, providing substantial accuracy gains over previous supervised meta-learning.

Anthology ID:: 2020.emnlp-main.38
Volume:: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Month:: November
Year:: 2020
Address:: Online
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 522–534
Language:
URL:: https://aclanthology.org/2020.emnlp-main.38
DOI:: 10.18653/v1/2020.emnlp-main.38
Bibkey:
Cite (ACL):: Trapit Bansal, Rishikesh Jha, Tsendsuren Munkhdalai, and Andrew McCallum. 2020. Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 522–534, Online. Association for Computational Linguistics.
Cite (Informal):: Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks (Bansal et al., EMNLP 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingestion-script-update/2020.emnlp-main.38.pdf
Video:: https://slideslive.com/38939198
Code: iesl/metanlp
Data: GLUE

PDF Search Code Video