Abstract
We present CALIMAGLF, a Gulf Arabic morphological analyzer currently covering over 2,600 verbal lemmas. We describe in detail the process of building the analyzer starting from phonetic dictionary entries to fully inflected orthographic paradigms and associated lexicon and orthographic variants. We evaluate the coverage of CALIMA-GLF against Modern Standard Arabic and Egyptian Arabic analyzers on part of a Gulf Arabic novel. CALIMA-GLF verb analysis token recall for identifying correct POS tag outperforms both the Modern Standard Arabic and Egyptian Arabic analyzers by over 27.4% and 16.9% absolute, respectively.- Anthology ID:
- W17-1305
- Volume:
- Proceedings of the Third Arabic Natural Language Processing Workshop
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Venue:
- WANLP
- SIG:
- SEMITIC
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 35–45
- Language:
- URL:
- https://aclanthology.org/W17-1305
- DOI:
- 10.18653/v1/W17-1305
- Cite (ACL):
- Salam Khalifa, Sara Hassan, and Nizar Habash. 2017. A Morphological Analyzer for Gulf Arabic Verbs. In Proceedings of the Third Arabic Natural Language Processing Workshop, pages 35–45, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- A Morphological Analyzer for Gulf Arabic Verbs (Khalifa et al., WANLP 2017)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/W17-1305.pdf
- Data
- Gumar Corpus