Effect of Multilingual and Domain-adapted Continual Pre-training on Few-shot Promptability

Ken Yano, Makoto Miwa


Abstract
Continual Pre-training (CPT) can help pre-trained large language models (LLMs) effectively adapt to new or under-trained domains or low-resource languages without re-training from scratch.Nevertheless, during CPT, the model’s few-shot transfer ability is known to be affected for emergent tasks.We verified this by comparing the performance between the few-shot and fine-tuning settings on the same tasks.We used Llama3-ELAINE-medLLM, which was continually pre-trained on Llama3-8B, targeted for the biomedical domain, and adapted for multilingual languages (English, Japanese, and Chinese).We compared the performance of Llama3-ELAINE-medLLM and Llama3-8B in three emergent tasks: two related domain tasks, entity recognition (NER) and machine translation (MT), and one out-of-domain task, summarization (SUM). Our experimental results show that degradation in few-shot transfer ability does not necessarily indicate the model’s underlying potential on the same task after fine-tuning.
Anthology ID:
2025.bionlp-1.2
Volume:
ACL 2025
Month:
August
Year:
2025
Address:
Viena, Austria
Editors:
Dina Demner-Fushman, Sophia Ananiadou, Makoto Miwa, Junichi Tsujii
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
18–26
Language:
URL:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.2/
DOI:
Bibkey:
Cite (ACL):
Ken Yano and Makoto Miwa. 2025. Effect of Multilingual and Domain-adapted Continual Pre-training on Few-shot Promptability. In ACL 2025, pages 18–26, Viena, Austria. Association for Computational Linguistics.
Cite (Informal):
Effect of Multilingual and Domain-adapted Continual Pre-training on Few-shot Promptability (Yano & Miwa, BioNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.2.pdf
Supplementarymaterial:
 2025.bionlp-1.2.SupplementaryMaterial.zip
Supplementarymaterial:
 2025.bionlp-1.2.SupplementaryMaterial.txt