Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator
Nizar Habash, Reham Marzouk, Christian Khairallah, Salam Khalifa
Abstract
Arabic is a morphologically rich and complex language, with numerous dialectal variants. Previous efforts on Arabic morphology modeling focused on specific variants and specific domains using a range of techniques with different degrees of linguistic modeling transparency. In this paper we propose a new approach to modeling Arabic morphology with an eye towards multi-dialectness, resource openness, and easy extensibility and use. We demonstrate our approach by modeling verbs from Standard Arabic and Egyptian Arabic, within a common framework, and with high coverage.- Anthology ID:
- 2022.sigmorphon-1.10
- Volume:
- Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
- Month:
- July
- Year:
- 2022
- Address:
- Seattle, Washington
- Editors:
- Garrett Nicolai, Eleanor Chodroff
- Venue:
- SIGMORPHON
- SIG:
- SIGMORPHON
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 92–102
- Language:
- URL:
- https://aclanthology.org/2022.sigmorphon-1.10
- DOI:
- 10.18653/v1/2022.sigmorphon-1.10
- Cite (ACL):
- Nizar Habash, Reham Marzouk, Christian Khairallah, and Salam Khalifa. 2022. Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator. In Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 92–102, Seattle, Washington. Association for Computational Linguistics.
- Cite (Informal):
- Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator (Habash et al., SIGMORPHON 2022)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/2022.sigmorphon-1.10.pdf
- Code
- CAMeL-Lab/camel_morph