language model 0.0025566399999999998
word features 0.00249865
word words 0.0024506099999999998
words word 0.0024506099999999998
classification features 0.0020633500000000003
grammar feature 0.0020327080000000003
language models 0.002024264
grammar features 0.001967458
feature set 0.0019325470000000002
language english 0.0018973990000000001
language modeling 0.0018757860000000002
native language 0.0018665550000000002
other words 0.001792688
function word 0.0017820330000000001
syntactic language 0.001777444
feature selection 0.0017612700000000001
features accuracy 0.0017343760000000001
second language 0.001724199
natural language 0.001713544
word collocations 0.001709387
lexical features 0.0016911320000000001
feature values 0.001689255
function words 0.001677643
language identification 0.0016693430000000002
language family 0.0016536760000000002
feature vector 0.001652279
tive language 0.001651858
language acquisition 0.0016472510000000002
feature space 0.001637999
language strings 0.0016355570000000002
language writing 0.001617505
language iden 0.001617505
content word 0.001605845
syntactic features 0.0015927340000000002
feature sets 0.001563471
tion word 0.001554524
individual feature 0.001551366
feature sparsity 0.001518222
useful features 0.0015170510000000002
combined feature 0.001511633
feature dimensionality 0.001508162
word choices 0.001505181
feature dimension 0.0015031530000000001
typical word 0.001502803
content words 0.001501455
feature vectors 0.001500638
feature schemata 0.001498523
dividual feature 0.001498523
model approaches 0.001480174
classification results 0.001466793
classification models 0.001460604
tion words 0.001450134
igram features 0.001436428
tactic features 0.001435883
cation features 0.001433821
language 0.00140586
different grammar 0.0014017209999999999
maxent model 0.0014010619999999998
tent words 0.001392921
data set 0.001350735
text classification 0.001335223
different set 0.00130156
feature 0.0012864
features 0.00122115
training data 0.001214289
test data 0.00120965
classification experiments 0.00119564
grammar collocations 0.001178195
words 0.00117311
topic models 0.001170084
adaptor grammar 0.0011663630000000001
pos tags 0.001157447
other work 0.0011565920000000001
model 0.00115078
classification task 0.001150368
classification setting 0.0011466879999999999
classification performance 0.00112374
base grammar 0.0011214110000000001
supervised classification 0.001118212
pos bigrams 0.001117357
modeling set 0.001116073
same data 0.0011072830000000001
perspective classification 0.001105797
maxent classification 0.001092482
different collocations 0.0010873
first set 0.001086713
nli classification 0.001085073
classification paradigm 0.001073124
general pos 0.0010703190000000001
modeling approach 0.001064398
classification perfor 0.001061224
classification accuracies 0.001059181
base distribution 0.001043027
pos colloca 0.001041436
pure pos 0.001036397
grammar induction 0.001034886
mixed pos 0.001033125
pos counterpart 0.001030116
rare pos 0.001029451
topic modeling 0.001021606
