Abstract
The research on machine learning of morphology often involves formulating morphological descriptions directly on surface forms of words. As the established two-level morphology paradigm requires the knowledge of the underlying structure, it is not widely used in such settings. In this paper, we propose a formalism describing structural relationships between words based on theories of morphology that reject the notions of internal word structure and morpheme. The formalism covers a wide variety of morphological phenomena (including non-concatenative ones like stem vowel alternation) without the need of workarounds and extensions. Furthermore, we show that morphological rules formulated in such way can be easily translated to FSTs, which enables us to derive performant approaches to morphological analysis, generation and automatic rule discovery.- Anthology ID:
- W19-3107
- Volume:
- Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing
- Month:
- September
- Year:
- 2019
- Address:
- Dresden, Germany
- Venue:
- FSMNLP
- SIG:
- SIGFSM
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 37–45
- Language:
- URL:
- https://aclanthology.org/W19-3107
- DOI:
- 10.18653/v1/W19-3107
- Cite (ACL):
- Maciej Janicki. 2019. Finite State Transducer Calculus for Whole Word Morphology. In Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing, pages 37–45, Dresden, Germany. Association for Computational Linguistics.
- Cite (Informal):
- Finite State Transducer Calculus for Whole Word Morphology (Janicki, FSMNLP 2019)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/W19-3107.pdf