Abstract
We use sequence-to-sequence networks trained on sequential phonetic encoding tasks to construct compositional phonological representations of words. We show that the output of an encoder network can predict the phonetic durations of American English words better than a number of alternative forms. We also show that the model’s learned representations map onto existing measures of words’ phonological structure (phonological neighborhood density and phonotactic probability).- Anthology ID:
- W19-4224
- Volume:
- Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Venue:
- ACL
- SIG:
- SIGMORPHON
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 206–217
- Language:
- URL:
- https://aclanthology.org/W19-4224
- DOI:
- 10.18653/v1/W19-4224
- Cite (ACL):
- Cassandra L. Jacobs and Fred Mailhot. 2019. Encoder-decoder models for latent phonological representations of words. In Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 206–217, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Encoder-decoder models for latent phonological representations of words (Jacobs & Mailhot, ACL 2019)
- PDF:
- https://preview.aclanthology.org/paclic-22-ingestion/W19-4224.pdf