Abstract
The identification of cognates and derivatives is a fundamental process in historical linguistics, on which any further research is based. In this paper we present our contribution to the SIGTYP 2023 Shared Task on cognate and derivative detection. We propose a multi-lingual solution based on features extracted from the alignment of the orthographic and phonetic representations of the words.- Anthology ID:
- 2023.sigtyp-1.15
- Volume:
- Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
- Month:
- May
- Year:
- 2023
- Address:
- Dubrovnik, Croatia
- Editors:
- Lisa Beinborn, Koustava Goswami, Saliha Muradoğlu, Alexey Sorokin, Ritesh Kumar, Andreas Shcherbakov, Edoardo M. Ponti, Ryan Cotterell, Ekaterina Vylomova
- Venue:
- SIGTYP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 137–142
- Language:
- URL:
- https://aclanthology.org/2023.sigtyp-1.15
- DOI:
- 10.18653/v1/2023.sigtyp-1.15
- Cite (ACL):
- Liviu P. Dinu, Ioan-Bogdan Iordache, and Ana Sabina Uban. 2023. CoToHiLi at SIGTYP 2023: Ensemble Models for Cognate and Derivative Words Detection. In Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, pages 137–142, Dubrovnik, Croatia. Association for Computational Linguistics.
- Cite (Informal):
- CoToHiLi at SIGTYP 2023: Ensemble Models for Cognate and Derivative Words Detection (Dinu et al., SIGTYP 2023)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-1/2023.sigtyp-1.15.pdf