Abstract
Fine-tuning pre-trained language models for downstream tasks has become a norm for NLP. Recently it is found that intermediate training can improve performance for fine-tuning language models for target tasks, high-level inference tasks such as Question Answering (QA) tend to work best as intermediate tasks. However it is not clear if intermediate training generally benefits various language models. In this paper, using the SQuAD-2.0 QA task for intermediate training for target text classification tasks, we experimented on eight tasks for single-sequence classification and eight tasks for sequence-pair classification using two base and two compact language models. Our experiments show that QA-based intermediate training generates varying transfer performance across different language models, except for similar QA tasks.- Anthology ID:
- 2021.alta-1.16
- Volume:
- Proceedings of the The 19th Annual Workshop of the Australasian Language Technology Association
- Month:
- December
- Year:
- 2021
- Address:
- Online
- Venue:
- ALTA
- SIG:
- Publisher:
- Australasian Language Technology Association
- Note:
- Pages:
- 158–162
- Language:
- URL:
- https://aclanthology.org/2021.alta-1.16
- DOI:
- Cite (ACL):
- Shiwei Zhang and Xiuzhen Zhang. 2021. Does QA-based intermediate training help fine-tuning language models for text classification?. In Proceedings of the The 19th Annual Workshop of the Australasian Language Technology Association, pages 158–162, Online. Australasian Language Technology Association.
- Cite (Informal):
- Does QA-based intermediate training help fine-tuning language models for text classification? (Zhang & Zhang, ALTA 2021)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2021.alta-1.16.pdf
- Data
- AG News, GLUE, LIAR, QNLI, SuperGLUE, TweetEval, WikiQA