Analysis of English Spelling Errors in a Word-Typing Game

Ryuichi Tachibana, Mamoru Komachi


Abstract
The emergence of the web has necessitated the need to detect and correct noisy consumer-generated texts. Most of the previous studies on English spelling-error extraction collected English spelling errors from web services such as Twitter by using the edit distance or from input logs utilizing crowdsourcing. However, in the former approach, it is not clear which word corresponds to the spelling error, and the latter approach requires an annotation cost for the crowdsourcing. One notable exception is Rodrigues and Rytting (2012), who proposed to extract English spelling errors by using a word-typing game. Their approach saves the cost of crowdsourcing, and guarantees an exact alignment between the word and the spelling error. However, they did not assert whether the extracted spelling error corpora reflect the usual writing process such as writing a document. Therefore, we propose a new correctable word-typing game that is more similar to the actual writing process. Experimental results showed that we can regard typing-game logs as a source of spelling errors.
Anthology ID:
L16-1060
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
385–390
Language:
URL:
https://aclanthology.org/L16-1060
DOI:
Bibkey:
Cite (ACL):
Ryuichi Tachibana and Mamoru Komachi. 2016. Analysis of English Spelling Errors in a Word-Typing Game. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 385–390, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Analysis of English Spelling Errors in a Word-Typing Game (Tachibana & Komachi, LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/L16-1060.pdf