Exploring Two-Phase Continual Instruction Fine-tuning for Multilingual Adaptation in Large Language Models
Divyanshu Aggarwal, Sankarshan Damle, Navin Goyal, Satya Lokam, Sunayana Sitaram
Abstract
A key challenge for Large Language Models (LLMs) is improving their Multilingual instruction-following ability over time without deteriorating their ability in languages they already excel at, typically English. In this paper, we study a two-phase Continual Fine-tuning (CFT) setup toward improving a model’s Multilingual adaptability. Concretely, we consider a two-phase CFT process in which an English-only end-to-end instruction fine-tuned LLM (Phase 1) is sequentially fine-tuned on a multilingual instruction dataset (Phase 2). Across MISTRAL-7B and LLAMA-3-8B and multiple dataset pairs, we show that instructional similarity between phases is critical: aligned datasets preserve or improve English while boosting multilingual ability, whereas misaligned datasets cause English degradation. We show that this degradation arises from representation shift during CFT, and that targeted mitigation strategies, including generative replay and heuristic-based layer freezing, reduce this shift and improve multilingual adaptation.- Anthology ID:
- 2026.findings-acl.1595
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 31882–31904
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1595/
- DOI:
- Cite (ACL):
- Divyanshu Aggarwal, Sankarshan Damle, Navin Goyal, Satya Lokam, and Sunayana Sitaram. 2026. Exploring Two-Phase Continual Instruction Fine-tuning for Multilingual Adaptation in Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 31882–31904, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Exploring Two-Phase Continual Instruction Fine-tuning for Multilingual Adaptation in Large Language Models (Aggarwal et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1595.pdf