Abstract
This paper presents an attempt at multiword expressions (MWEs) discovery in the Persian language. It focuses on extracting MWEs containing lemmas of a particular group: loanwords in Persian and their equivalents proposed by the Academy of Persian Language and Literature. In order to discover such MWEs, four association measures (AMs) are used and evaluated. Finally, the list of extracted MWEs is analyzed, and a comparison between expressions with loanwords and equivalents is presented. To our knowledge, this is the first time such analysis was provided for the Persian language.- Anthology ID:
- 2021.ranlp-1.105
- Volume:
- Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)
- Month:
- September
- Year:
- 2021
- Address:
- Held Online
- Editors:
- Ruslan Mitkov, Galia Angelova
- Venue:
- RANLP
- SIG:
- Publisher:
- INCOMA Ltd.
- Note:
- Pages:
- 918–928
- Language:
- URL:
- https://aclanthology.org/2021.ranlp-1.105
- DOI:
- Cite (ACL):
- Katarzyna Marszałek-Kowalewska. 2021. Discovery of Multiword Expressions with Loanwords and Their Equivalents in the Persian Language. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 918–928, Held Online. INCOMA Ltd..
- Cite (Informal):
- Discovery of Multiword Expressions with Loanwords and Their Equivalents in the Persian Language (Marszałek-Kowalewska, RANLP 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2021.ranlp-1.105.pdf