Abstract
Probabilistic approaches have proven themselves well in learning phonological structure. In contrast, theoretical linguistics usually works with deterministic generalizations. The goal of this paper is to explore possible interactions between information-theoretic methods and deterministic linguistic knowledge and to examine some ways in which both can be used in tandem to extract phonological and morphophonological patterns from a small annotated dataset. Local and nonlocal processes in Mishar Tatar (Turkic/Kipchak) are examined as a case study.- Anthology ID:
- W18-5819
- Volume:
- Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology
- Month:
- October
- Year:
- 2018
- Address:
- Brussels, Belgium
- Venue:
- EMNLP
- SIG:
- SIGMORPHON
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 167–175
- Language:
- URL:
- https://aclanthology.org/W18-5819
- DOI:
- 10.18653/v1/W18-5819
- Cite (ACL):
- Marina Ermolaeva. 2018. Extracting Morphophonology from Small Corpora. In Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 167–175, Brussels, Belgium. Association for Computational Linguistics.
- Cite (Informal):
- Extracting Morphophonology from Small Corpora (Ermolaeva, EMNLP 2018)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/W18-5819.pdf