Extracting Morphophonology from Small Corpora

Marina Ermolaeva


Abstract
Probabilistic approaches have proven themselves well in learning phonological structure. In contrast, theoretical linguistics usually works with deterministic generalizations. The goal of this paper is to explore possible interactions between information-theoretic methods and deterministic linguistic knowledge and to examine some ways in which both can be used in tandem to extract phonological and morphophonological patterns from a small annotated dataset. Local and nonlocal processes in Mishar Tatar (Turkic/Kipchak) are examined as a case study.
Anthology ID:
W18-5819
Volume:
Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
October
Year:
2018
Address:
Brussels, Belgium
Editors:
Sandra Kuebler, Garrett Nicolai
Venue:
EMNLP
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
167–175
Language:
URL:
https://aclanthology.org/W18-5819
DOI:
10.18653/v1/W18-5819
Bibkey:
Cite (ACL):
Marina Ermolaeva. 2018. Extracting Morphophonology from Small Corpora. In Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 167–175, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
Extracting Morphophonology from Small Corpora (Ermolaeva, EMNLP 2018)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/W18-5819.pdf