A Hybrid System for NLPTEA-2020 CGED Shared Task

Meiyuan Fang, Kai Fu, Jiping Wang, Yang Liu, Jin Huang, Yitao Duan


Abstract
This paper introduces our system at NLPTEA2020 shared task for CGED, which is able to detect, locate, identify and correct grammatical errors in Chinese writings. The system consists of three components: GED, GEC, and post processing. GED is an ensemble of multiple BERT-based sequence labeling models for handling GED tasks. GEC performs error correction. We exploit a collection of heterogenous models, including Seq2Seq, GECToR and a candidate generation module to obtain correction candidates. Finally in the post processing stage, results from GED and GEC are fused to form the final outputs. We tune our models to lean towards optimizing precision, which we believe is more crucial in practice. As a result, among the six tracks in the shared task, our system performs well in the correction tracks: measured in F1 score, we rank first, with the highest precision, in the TOP3 correction track and third in the TOP1 correction track, also with the highest precision. Ours are among the top 4 to 6 in other tracks, except for FPR where we rank 12. And our system achieves the highest precisions among the top 10 submissions at IDENTIFICATION and POSITION tracks.
Anthology ID:
2020.nlptea-1.9
Volume:
Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications
Month:
December
Year:
2020
Address:
Suzhou, China
Editors:
Erhong YANG, Endong XUN, Baolin ZHANG, Gaoqi RAO
Venue:
NLP-TEA
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
67–77
Language:
URL:
https://aclanthology.org/2020.nlptea-1.9
DOI:
10.18653/v1/2020.nlptea-1.9
Bibkey:
Cite (ACL):
Meiyuan Fang, Kai Fu, Jiping Wang, Yang Liu, Jin Huang, and Yitao Duan. 2020. A Hybrid System for NLPTEA-2020 CGED Shared Task. In Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications, pages 67–77, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
A Hybrid System for NLPTEA-2020 CGED Shared Task (Fang et al., NLP-TEA 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2020.nlptea-1.9.pdf