Abstract
This work describes the Edinburgh submission to the SIGMORPHON 2021 Shared Task 2 on unsupervised morphological paradigm clustering. Given raw text input, the task was to assign each token to a cluster with other tokens from the same paradigm. We use Adaptor Grammar segmentations combined with frequency-based heuristics to predict paradigm clusters. Our system achieved the highest average F1 score across 9 test languages, placing first out of 15 submissions.- Anthology ID:
- 2021.sigmorphon-1.9
- Volume:
- Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
- Month:
- August
- Year:
- 2021
- Address:
- Online
- Editors:
- Garrett Nicolai, Kyle Gorman, Ryan Cotterell
- Venue:
- SIGMORPHON
- SIG:
- SIGMORPHON
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 82–89
- Language:
- URL:
- https://aclanthology.org/2021.sigmorphon-1.9
- DOI:
- 10.18653/v1/2021.sigmorphon-1.9
- Cite (ACL):
- Kate McCurdy, Sharon Goldwater, and Adam Lopez. 2021. Adaptor Grammars for Unsupervised Paradigm Clustering. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 82–89, Online. Association for Computational Linguistics.
- Cite (Informal):
- Adaptor Grammars for Unsupervised Paradigm Clustering (McCurdy et al., SIGMORPHON 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/2021.sigmorphon-1.9.pdf