Abstract
We propose a sequence labelling approach to word-level morpheme segmentation. Segmentation labels are edit operations derived from a modified minimum edit distance alignment. We show that sequence labelling performs well for “shallow segmentation” and “canonical segmentation”, achieving 96.06 f1 score (macroaveraged over all languages in the shared task) and ranking 3rd among all participating teams. Therefore, we conclude that sequence labelling is a promising approach to morpheme segmentation.- Anthology ID:
- 2022.sigmorphon-1.13
- Volume:
- Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
- Month:
- July
- Year:
- 2022
- Address:
- Seattle, Washington
- Venue:
- SIGMORPHON
- SIG:
- SIGMORPHON
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 124–130
- Language:
- URL:
- https://aclanthology.org/2022.sigmorphon-1.13
- DOI:
- 10.18653/v1/2022.sigmorphon-1.13
- Cite (ACL):
- Leander Girrbach. 2022. SIGMORPHON 2022 Shared Task on Morpheme Segmentation Submission Description: Sequence Labelling for Word-Level Morpheme Segmentation. In Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 124–130, Seattle, Washington. Association for Computational Linguistics.
- Cite (Informal):
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation Submission Description: Sequence Labelling for Word-Level Morpheme Segmentation (Girrbach, SIGMORPHON 2022)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2022.sigmorphon-1.13.pdf