Bernardo Magnini
Other people with similar names: Bernardo Magnini
Unverified author pages with similar names: Bernardo Magnini
2026
Thesis Proposal: LLMs post-training for multilingual medical tasks. Instruction-Tuning, Continual-Pretraining or Reasoning?
Pietro Ferrazzi | Alberto Lavelli | Bernardo Magnini
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Pietro Ferrazzi | Alberto Lavelli | Bernardo Magnini
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Adapting Large Language Models to the medical domain remains an active area of research, with multiple strategies proposed to leverage annotated and unannotated data effectively. In this work, we propose a thesis outline to compare three common adaptation approaches—Instruction Tuning, Continual Pretraining, and Reasoning-oriented Training. We identify 5 dimensions to analyse: i) the interaction between the adaptation technique and the tasks; ii) the impact of the data size on the downstream performance; iii) the differences between datasets required by the three techniques; iv) the impact of the techniques given the model size; v) the impact of the techniques given the language.We construct an evaluation framework composed by 5 multilingual medical NLP tasks (named entity recognition, relation extraction, question answering, case report form filling, argument mining), spanning on 21 datasets in English, Italian, and Spanish, for a total of 61 combinations of language and sub-task.