Abstract
We present a broad coverage model of Turkish morphology and an open-source morphological analyzer that implements it. The model captures intricacies of Turkish morphology-syntax interface, thus could be used as a baseline that guides language model development. It introduces a novel fine part-of-speech tagset, a fine-grained affix inventory and represents morphotactics without zero-derivations. The morphological analyzer is freely available. It consists of modular reusable components of human-annotated gold standard lexicons, implements Turkish morphotactics as finite-state transducers using OpenFst and morphophonemic processes as Thrax grammars.- Anthology ID:
- W19-3110
- Volume:
- Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing
- Month:
- September
- Year:
- 2019
- Address:
- Dresden, Germany
- Venue:
- FSMNLP
- SIG:
- SIGFSM
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 65–75
- Language:
- URL:
- https://aclanthology.org/W19-3110
- DOI:
- 10.18653/v1/W19-3110
- Cite (ACL):
- Adnan Ozturel, Tolga Kayadelen, and Isin Demirsahin. 2019. A Syntactically Expressive Morphological Analyzer for Turkish. In Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing, pages 65–75, Dresden, Germany. Association for Computational Linguistics.
- Cite (Informal):
- A Syntactically Expressive Morphological Analyzer for Turkish (Ozturel et al., FSMNLP 2019)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/W19-3110.pdf