Discovery of Multiword Expressions with Loanwords and Their Equivalents in the Persian Language

Katarzyna Marszałek-Kowalewska


Abstract
This paper presents an attempt at multiword expressions (MWEs) discovery in the Persian language. It focuses on extracting MWEs containing lemmas of a particular group: loanwords in Persian and their equivalents proposed by the Academy of Persian Language and Literature. In order to discover such MWEs, four association measures (AMs) are used and evaluated. Finally, the list of extracted MWEs is analyzed, and a comparison between expressions with loanwords and equivalents is presented. To our knowledge, this is the first time such analysis was provided for the Persian language.
Anthology ID:
2021.ranlp-1.105
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)
Month:
September
Year:
2021
Address:
Held Online
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
918–928
Language:
URL:
https://aclanthology.org/2021.ranlp-1.105
DOI:
Bibkey:
Cite (ACL):
Katarzyna Marszałek-Kowalewska. 2021. Discovery of Multiword Expressions with Loanwords and Their Equivalents in the Persian Language. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 918–928, Held Online. INCOMA Ltd..
Cite (Informal):
Discovery of Multiword Expressions with Loanwords and Their Equivalents in the Persian Language (Marszałek-Kowalewska, RANLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-2/2021.ranlp-1.105.pdf