Abstract
We offer one explanation for the historically low performance of French in the SIGMORPHON-UniMorph shared tasks. We conducted experiments replicating the 2023 task on French with the non-neural and neural baselines, first using the original task splits, and then using splits that excluded Old and Middle French lemmas. We applied a taxonomy on our errors using a framework based on Kyle Gorman’s “Weird Inflects but OK” 2019 annotation scheme, finding that a high portion of the French errors produced with the original splits were due to the inclusion of Old French forms, which was resolved with cleaned data.- Anthology ID:
- 2024.sigmorphon-1.5
- Volume:
- Proceedings of the 21st SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology
- Month:
- June
- Year:
- 2024
- Address:
- Mexico City, Mexico
- Editors:
- Garrett Nicolai, Eleanor Chodroff, Frederic Mailhot, Çağrı Çöltekin
- Venue:
- SIGMORPHON
- SIG:
- SIGMORPHON
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 39–50
- Language:
- URL:
- https://aclanthology.org/2024.sigmorphon-1.5
- DOI:
- Cite (ACL):
- William Kezerian and Kristine Yu. 2024. Ye Olde French: Effect of Old and Middle French on SIGMORPHON-UniMorph Shared Task Data. In Proceedings of the 21st SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 39–50, Mexico City, Mexico. Association for Computational Linguistics.
- Cite (Informal):
- Ye Olde French: Effect of Old and Middle French on SIGMORPHON-UniMorph Shared Task Data (Kezerian & Yu, SIGMORPHON 2024)
- PDF:
- https://preview.aclanthology.org/fix-volume-bibkeys/2024.sigmorphon-1.5.pdf