Chinese Grammatical Errors Diagnosis System Based on BERT at NLPTEA-2020 CGED Shared Task

Hongying Zan, Yangchao Han, Haotian Huang, Yingjie Yan, Yuke Wang, Yingjie Han


Abstract
In the process of learning Chinese, second language learners may have various grammatical errors due to the negative transfer of native language. This paper describes our submission to the NLPTEA 2020 shared task on CGED. We present a hybrid system that utilizes both detection and correction stages. The detection stage is a sequential labelling model based on BiLSTM-CRF and BERT contextual word representation. The correction stage is a hybrid model based on the n-gram and Seq2Seq. Without adding additional features and external data, the BERT contextual word representation can effectively improve the performance metrics of Chinese grammatical error detection and correction.
Anthology ID:
2020.nlptea-1.14
Volume:
Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications
Month:
December
Year:
2020
Address:
Suzhou, China
Venue:
NLP-TEA
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
102–107
Language:
URL:
https://aclanthology.org/2020.nlptea-1.14
DOI:
Bibkey:
Cite (ACL):
Hongying Zan, Yangchao Han, Haotian Huang, Yingjie Yan, Yuke Wang, and Yingjie Han. 2020. Chinese Grammatical Errors Diagnosis System Based on BERT at NLPTEA-2020 CGED Shared Task. In Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications, pages 102–107, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Chinese Grammatical Errors Diagnosis System Based on BERT at NLPTEA-2020 CGED Shared Task (Zan et al., NLP-TEA 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.nlptea-1.14.pdf