SubmissionNumber#=%=#18 FinalPaperTitle#=%=#DIALECT-COPA: Extending the Standard Translations of the COPA Causal Commonsense Reasoning Dataset to South Slavic Dialects ShortPaperTitle#=%=# NumberOfPages#=%=#10 CopyrightSigned#=%=# JobTitle#==# Organization#==# Abstract#==#The paper presents new causal commonsense reasoning datasets for South Slavic dialects, based on the Choice of Plausible Alternatives (COPA) dataset. The dialectal datasets are built by translating by native dialect speakers from the English original and the corresponding standard translation. Three dialects are covered -- the Cerkno dialect of Slovenian, the Chakavian dialect of Croatian and the Torlak dialect of Serbian. The datasets are the first resource for evaluation of large language models on South Slavic dialects, as well as among the first commonsense reasoning datasets on dialects overall. The paper describes specific challenges met during the translation process. A comparison of the dialectal datasets with their standard language counterparts shows a varying level of character-level, word-level and lexicon-level deviation of dialectal text from the standard datasets. The observed differences are well reproduced in initial zero-shot and 10-shot experiments, where the Slovenian Cerkno dialect and the Croatian Chakavian dialect show significantly lower results than the Torlak dialect. These results show also for the dialectal datasets to be significantly more challenging than the standard datasets. Finally, in-context learning on just 10 examples shows to improve the results dramatically, especially for the dialects with the lowest results. Author{1}{Firstname}#=%=#Nikola Author{1}{Lastname}#=%=#Ljubešić Author{1}{Username}#=%=#nljubesi Author{1}{Email}#=%=#nikola.ljubesic@ijs.si Author{1}{Affiliation}#=%=#Jožef Stefan Institute Author{2}{Firstname}#=%=#Nada Author{2}{Lastname}#=%=#Galant Author{2}{Email}#=%=#nada.galant@gmail.com Author{2}{Affiliation}#=%=#Čakavski sabor Author{3}{Firstname}#=%=#Sonja Author{3}{Lastname}#=%=#Benčina Author{3}{Email}#=%=#be.sonja@gmail.com Author{3}{Affiliation}#=%=#Parafraza Author{4}{Firstname}#=%=#Jaka Author{4}{Lastname}#=%=#Čibej Author{4}{Username}#=%=#jaka_cibej Author{4}{Email}#=%=#jaka.cibej@ff.uni-lj.si Author{4}{Affiliation}#=%=#University of Ljubljana Author{5}{Firstname}#=%=#Stefan Author{5}{Lastname}#=%=#Milosavljević Author{5}{Email}#=%=#stefannmilosavljevic@gmail.com Author{5}{Affiliation}#=%=#Karl-Franzens-Universität Graz Author{6}{Firstname}#=%=#Peter Author{6}{Lastname}#=%=#Rupnik Author{6}{Username}#=%=#boutrosboutrosrupnik Author{6}{Email}#=%=#peter.rupnik@ijs.si Author{6}{Affiliation}#=%=#Jožef Stefan Institute Author{7}{Firstname}#=%=#Taja Author{7}{Lastname}#=%=#Kuzman Author{7}{Username}#=%=#tajakuz Author{7}{Email}#=%=#taja.kuzman@ijs.si Author{7}{Affiliation}#=%=#Jožef Stefan Institute ========== èéáğö