A New Error Annotation for Dyslexic texts in Arabic

Maha Alamri, William J Teahan


Abstract
This paper aims to develop a new classification of errors made in Arabic by those suffering from dyslexia to be used in the annotation of the Arabic dyslexia corpus (BDAC). The dyslexic error classification for Arabic texts (DECA) comprises a list of spelling errors extracted from previous studies and a collection of texts written by people with dyslexia that can provide a framework to help analyse specific errors committed by dyslexic writers. The classification comprises 37 types of errors, grouped into nine categories. The paper also discusses building a corpus of dyslexic Arabic texts that uses the error annotation scheme and provides an analysis of the errors that were found in the texts.
Anthology ID:
W17-1309
Volume:
Proceedings of the Third Arabic Natural Language Processing Workshop
Month:
April
Year:
2017
Address:
Valencia, Spain
Venue:
WANLP
SIG:
SEMITIC
Publisher:
Association for Computational Linguistics
Note:
Pages:
72–78
Language:
URL:
https://aclanthology.org/W17-1309
DOI:
10.18653/v1/W17-1309
Bibkey:
Cite (ACL):
Maha Alamri and William J Teahan. 2017. A New Error Annotation for Dyslexic texts in Arabic. In Proceedings of the Third Arabic Natural Language Processing Workshop, pages 72–78, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
A New Error Annotation for Dyslexic texts in Arabic (Alamri & Teahan, WANLP 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/nodalida-main-page/W17-1309.pdf