Abstract
This paper reports our contribution (team WLZ) to the NLI Shared Task 2017 (essay track). We first extract lexical and syntactic features from the essays, perform feature weighting and selection, and train linear support vector machine (SVM) classifiers each on an individual feature type. The output of base classifiers, as probabilities for each class, are then fed into a multilayer perceptron to predict the native language of the author. We also report the performance of each feature type, as well as the best features of a type. Our system achieves an accuracy of 86.55%, which is among the best performing systems of this shared task.- Anthology ID:
- W17-5044
- Volume:
- Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications
- Month:
- September
- Year:
- 2017
- Address:
- Copenhagen, Denmark
- Editors:
- Joel Tetreault, Jill Burstein, Claudia Leacock, Helen Yannakoudakis
- Venue:
- BEA
- SIG:
- SIGEDU
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 390–397
- Language:
- URL:
- https://aclanthology.org/W17-5044
- DOI:
- 10.18653/v1/W17-5044
- Cite (ACL):
- Wen Li and Liang Zou. 2017. Classifier Stacking for Native Language Identification. In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, pages 390–397, Copenhagen, Denmark. Association for Computational Linguistics.
- Cite (Informal):
- Classifier Stacking for Native Language Identification (Li & Zou, BEA 2017)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/W17-5044.pdf