Abstract
This paper aims to develop a new classification of errors made in Arabic by those suffering from dyslexia to be used in the annotation of the Arabic dyslexia corpus (BDAC). The dyslexic error classification for Arabic texts (DECA) comprises a list of spelling errors extracted from previous studies and a collection of texts written by people with dyslexia that can provide a framework to help analyse specific errors committed by dyslexic writers. The classification comprises 37 types of errors, grouped into nine categories. The paper also discusses building a corpus of dyslexic Arabic texts that uses the error annotation scheme and provides an analysis of the errors that were found in the texts.- Anthology ID:
- W17-1309
- Volume:
- Proceedings of the Third Arabic Natural Language Processing Workshop
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Venue:
- WANLP
- SIG:
- SEMITIC
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 72–78
- Language:
- URL:
- https://aclanthology.org/W17-1309
- DOI:
- 10.18653/v1/W17-1309
- Cite (ACL):
- Maha Alamri and William J Teahan. 2017. A New Error Annotation for Dyslexic texts in Arabic. In Proceedings of the Third Arabic Natural Language Processing Workshop, pages 72–78, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- A New Error Annotation for Dyslexic texts in Arabic (Alamri & Teahan, WANLP 2017)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/W17-1309.pdf