Lilla Magyar

2019

pdf abs
What do phone embeddings learn about Phonology?
Sudheer Kolachina | Lilla Magyar
Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology

Recent work has looked at evaluation of phone embeddings using sound analogies and correlations between distinctive feature space and embedding space. It has not been clear what aspects of natural language phonology are learnt by neural network inspired distributed representational models such as word2vec. To study the kinds of phonological relationships learnt by phone embeddings, we present artificial phonology experiments that show that phone embeddings learn paradigmatic relationships such as phonemic and allophonic distribution quite well. They are also able to capture co-occurrence restrictions among vowels such as those observed in languages with vowel harmony. However, they are unable to learn co-occurrence restrictions among the class of consonants.

Co-authors

Sudheer Kolachina 1

Venues

acl1