When natural language is not enough: The limits of in-context learning demonstrations in multilingual reasoning

Leonardo Ranaldi, Barry Haddow, Alexandra Birch


Abstract
Previous studies have demonstrated the effectiveness of reasoning methods in eliciting multi-step reasoned answers from Large Language Models (LLMs) by leveraging in-context demonstrations. These methods, exemplified by Chain-of-Thought (CoT) and Program-Aided Language Models (PAL), have been shown to perform well in monolingual contexts, primarily in English. There has, however, been limited exploration of their abilities in other languages.To gain a deeper understanding of the role of reasoning methods for in-context demonstrations, we investigate how well CoT and PAL perform across languages for arithmetic and symbolic reasoning tasks. Our findings indicate that the effectiveness of reasoning methods varies significantly across different languages and models. Specifically, CoT, which relies on natural language demonstrations, tends to be more accurate in high-resource than in low-resource languages. Conversely, the structured nature of PAL demonstrations facilitates multilingual comprehension, enabling LLMs to generate programmatic answers in both high- and low-resource languages and leading to significant performance improvements over CoT concerning the accuracy of the generated responses.
Anthology ID:
2025.findings-naacl.412
Volume:
Findings of the Association for Computational Linguistics: NAACL 2025
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7369–7396
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.findings-naacl.412/
DOI:
Bibkey:
Cite (ACL):
Leonardo Ranaldi, Barry Haddow, and Alexandra Birch. 2025. When natural language is not enough: The limits of in-context learning demonstrations in multilingual reasoning. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 7369–7396, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
When natural language is not enough: The limits of in-context learning demonstrations in multilingual reasoning (Ranaldi et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.findings-naacl.412.pdf