PROM: Pivoted and Regulated Optimization for Multilingual Instruction Learning
Jaeseong Lee, Seung-won Hwang, Hojin Lee, Yunju Bak, Changmin Lee
Abstract
Large language models (LLMs) have become standard for natural language generation tasks, with instruction-tuning enhancing their capabilities. However, the lack of instruction-tuning datasets in languages other than English limits their application to diverse languages. To address this, researchers have adapted English-centric LLMs to other languages by appending English tuning data with its translated pair, from which we observe negative interference between the two. To resolve this, our contribution is identifying English as an internal pivot language, based on which we disentangle the roles of English and target language data in training. Specifically, we first design two roles as pivoted objectives, and also propose to regulate between the two, to better generalize for under-represented languages. Experiments across various languages demonstrate the effectiveness of our approach on multiple benchmarks. The code is publicly available for further exploration.- Anthology ID:
- 2025.naacl-short.19
- Volume:
- Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
- Month:
- April
- Year:
- 2025
- Address:
- Albuquerque, New Mexico
- Editors:
- Luis Chiruzzo, Alan Ritter, Lu Wang
- Venue:
- NAACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 222–228
- Language:
- URL:
- https://preview.aclanthology.org/fix-sig-urls/2025.naacl-short.19/
- DOI:
- Cite (ACL):
- Jaeseong Lee, Seung-won Hwang, Hojin Lee, Yunju Bak, and Changmin Lee. 2025. PROM: Pivoted and Regulated Optimization for Multilingual Instruction Learning. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 222–228, Albuquerque, New Mexico. Association for Computational Linguistics.
- Cite (Informal):
- PROM: Pivoted and Regulated Optimization for Multilingual Instruction Learning (Lee et al., NAACL 2025)
- PDF:
- https://preview.aclanthology.org/fix-sig-urls/2025.naacl-short.19.pdf