Finite State Transducer Calculus for Whole Word Morphology

Maciej Janicki


Abstract
The research on machine learning of morphology often involves formulating morphological descriptions directly on surface forms of words. As the established two-level morphology paradigm requires the knowledge of the underlying structure, it is not widely used in such settings. In this paper, we propose a formalism describing structural relationships between words based on theories of morphology that reject the notions of internal word structure and morpheme. The formalism covers a wide variety of morphological phenomena (including non-concatenative ones like stem vowel alternation) without the need of workarounds and extensions. Furthermore, we show that morphological rules formulated in such way can be easily translated to FSTs, which enables us to derive performant approaches to morphological analysis, generation and automatic rule discovery.
Anthology ID:
W19-3107
Volume:
Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing
Month:
September
Year:
2019
Address:
Dresden, Germany
Venue:
FSMNLP
SIG:
SIGFSM
Publisher:
Association for Computational Linguistics
Note:
Pages:
37–45
Language:
URL:
https://aclanthology.org/W19-3107
DOI:
10.18653/v1/W19-3107
Bibkey:
Cite (ACL):
Maciej Janicki. 2019. Finite State Transducer Calculus for Whole Word Morphology. In Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing, pages 37–45, Dresden, Germany. Association for Computational Linguistics.
Cite (Informal):
Finite State Transducer Calculus for Whole Word Morphology (Janicki, FSMNLP 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/W19-3107.pdf