Abstract
Automatic personalized corrective feedback can help language learners from different backgrounds better acquire a new language. This paper introduces a learner English dataset in which learner errors are accompanied by information about possible error sources. This dataset contains manually annotated error causes for learner writing errors. These causes tie learner mistakes to structures from their first languages, when the rules in English and in the first language diverge. This new dataset will enable second language acquisition researchers to computationally analyze a large quantity of learner errors that are related to language transfer from the learners’ first language. The dataset can also be applied in personalizing grammatical error correction systems according to the learners’ first language and in providing feedback that is informed by the cause of an error.- Anthology ID:
- 2021.naacl-main.251
- Volume:
- Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
- Month:
- June
- Year:
- 2021
- Address:
- Online
- Venue:
- NAACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 3129–3142
- Language:
- URL:
- https://aclanthology.org/2021.naacl-main.251
- DOI:
- 10.18653/v1/2021.naacl-main.251
- Cite (ACL):
- Leticia Farias Wanderley, Nicole Zhao, and Carrie Demmans Epp. 2021. Negative language transfer in learner English: A new dataset. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3129–3142, Online. Association for Computational Linguistics.
- Cite (Informal):
- Negative language transfer in learner English: A new dataset (Farias Wanderley et al., NAACL 2021)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/2021.naacl-main.251.pdf
- Code
- EdTeKLA/LanguageTransfer
- Data
- FCE, Penn Treebank