Classifier Stacking for Native Language Identification

Wen Li, Liang Zou


Abstract
This paper reports our contribution (team WLZ) to the NLI Shared Task 2017 (essay track). We first extract lexical and syntactic features from the essays, perform feature weighting and selection, and train linear support vector machine (SVM) classifiers each on an individual feature type. The output of base classifiers, as probabilities for each class, are then fed into a multilayer perceptron to predict the native language of the author. We also report the performance of each feature type, as well as the best features of a type. Our system achieves an accuracy of 86.55%, which is among the best performing systems of this shared task.
Anthology ID:
W17-5044
Volume:
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications
Month:
September
Year:
2017
Address:
Copenhagen, Denmark
Editors:
Joel Tetreault, Jill Burstein, Claudia Leacock, Helen Yannakoudakis
Venue:
BEA
SIG:
SIGEDU
Publisher:
Association for Computational Linguistics
Note:
Pages:
390–397
Language:
URL:
https://aclanthology.org/W17-5044
DOI:
10.18653/v1/W17-5044
Bibkey:
Cite (ACL):
Wen Li and Liang Zou. 2017. Classifier Stacking for Native Language Identification. In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, pages 390–397, Copenhagen, Denmark. Association for Computational Linguistics.
Cite (Informal):
Classifier Stacking for Native Language Identification (Li & Zou, BEA 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-3/W17-5044.pdf