A Syntactically Expressive Morphological Analyzer for Turkish

Adnan Ozturel, Tolga Kayadelen, Isin Demirsahin


Abstract
We present a broad coverage model of Turkish morphology and an open-source morphological analyzer that implements it. The model captures intricacies of Turkish morphology-syntax interface, thus could be used as a baseline that guides language model development. It introduces a novel fine part-of-speech tagset, a fine-grained affix inventory and represents morphotactics without zero-derivations. The morphological analyzer is freely available. It consists of modular reusable components of human-annotated gold standard lexicons, implements Turkish morphotactics as finite-state transducers using OpenFst and morphophonemic processes as Thrax grammars.
Anthology ID:
W19-3110
Volume:
Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing
Month:
September
Year:
2019
Address:
Dresden, Germany
Venue:
FSMNLP
SIG:
SIGFSM
Publisher:
Association for Computational Linguistics
Note:
Pages:
65–75
Language:
URL:
https://aclanthology.org/W19-3110
DOI:
10.18653/v1/W19-3110
Bibkey:
Cite (ACL):
Adnan Ozturel, Tolga Kayadelen, and Isin Demirsahin. 2019. A Syntactically Expressive Morphological Analyzer for Turkish. In Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing, pages 65–75, Dresden, Germany. Association for Computational Linguistics.
Cite (Informal):
A Syntactically Expressive Morphological Analyzer for Turkish (Ozturel et al., FSMNLP 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/W19-3110.pdf