Does the Language Matter? Curriculum Learning over Neo-Latin Languages

Leonardo Ranaldi, Giulia Pucci, André Freitas


Abstract
Curriculum Learning (CL) has been emerged as an effective technique for improving the performances and reducing the cost of pre-training Large Language Models (LLMs). The efficacy of CL demonstrated in different scenarios is in the training LLMs by organizing examples from the simplest to the most complex. Although improvements have been shown extensively, this approach was used for pre-training, leaving novel fine-tuning approaches such as instruction-tuning unexplored. In this paper, we propose a novel complexity measure to empower the instruction-tuning method using the CL paradigm. To complement previous works, we propose cognitively motivated measures to determine the complexity of training demonstrations used in the instruction-tuning paradigm. Hence, we experiment with the proposed heuristics first in English and then in other languages. The downstream results show that delivering training examples by complexity ranking is also effective for instruction tuning, as it improves downstream performance while reducing costs. Furthermore, the technique can be easily transferred to languages other than English, e.g., Italian and French, without any adaptation, maintaining functionality and effectiveness.
Anthology ID:
2024.lrec-main.464
Volume:
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
Venues:
LREC | COLING
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
5212–5220
Language:
URL:
https://preview.aclanthology.org/ingest_wac_2008/2024.lrec-main.464/
DOI:
Bibkey:
Cite (ACL):
Leonardo Ranaldi, Giulia Pucci, and André Freitas. 2024. Does the Language Matter? Curriculum Learning over Neo-Latin Languages. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 5212–5220, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Does the Language Matter? Curriculum Learning over Neo-Latin Languages (Ranaldi et al., LREC-COLING 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest_wac_2008/2024.lrec-main.464.pdf