different language 0.00240567
same language 0.002083374
language information 0.002057254
training data 0.002004627
data set 0.00190973
test data 0.0018915120000000001
language new 0.001891359
other words 0.001798575
words approach 0.001738438
words method 0.0016985569999999998
particular language 0.001687425
language query 0.001672146
data size 0.001661471
language identification 0.0016611619999999999
language processing 0.001642227
data sets 0.0016118340000000001
available data 0.001602193
states data 0.001596048
native language 0.001596014
knowledge language 0.001578654
reference language 0.001577653
natural language 0.001571768
different classification 0.001569702
single words 0.001568501
language anno 0.001558775
final data 0.0015582900000000002
reliable data 0.001554325
data generation 0.0015514880000000002
default language 0.001548625
language minc 0.0015419259999999999
annotated data 0.0015387360000000002
different languages 0.001534693
training corpus 0.001532496
textual language 0.00152994
language identifi 0.001526572
morphological model 0.001525818
language identi 0.001523067
language stat 0.001521691
intended language 0.001521691
language intent 0.001521691
language identifica 0.001521691
tified language 0.001521691
inital data 0.0015136890000000001
data gener 0.0015136890000000001
common words 0.001471478
multiple words 0.001414086
same test 0.001403256
short words 0.001364208
different clas 0.001352283
words technique 0.001347685
composite words 0.001347502
statistical model 0.0013445710000000001
language 0.00128973
frequency features 0.001272197
information feature 0.001269876
test set 0.001237442
word counts 0.0012203729999999999
new test 0.001211241
tion model 0.001195823
new method 0.001195196
unigram model 0.001181902
affix information 0.001178513
high frequency 0.001176518
same query 0.00117606
same way 0.001162986
combined model 0.0011628160000000001
gram model 0.001152805
training corpora 0.001145817
tical model 0.001139391
morphological feature 0.001120502
other languages 0.001112338
words 0.00110499
new feature 0.001103981
annotation results 0.0010923859999999999
other models 0.001091182
other systems 0.001057987
tree classifier 0.001044104
same parameters 0.001043921
additional features 0.001037175
geographical information 0.00103105
same languge 0.001027008
probability error 0.001008705
tree system 0.001002266
total frequency 9.955979999999999E-4
english queries 9.94412E-4
letter frequency 9.93334E-4
successful results 9.87097E-4
time period 9.81759E-4
unsupervised system 9.71706E-4
months time 9.697060000000001E-4
other lan 9.61131E-4
frequency threshold 9.57648E-4
training instances 9.57288E-4
training phase 9.553610000000001E-4
linguistic features 9.5115E-4
other types 9.50067E-4
other hand 9.475480000000001E-4
individual features 9.44437E-4
such cases 9.430230000000001E-4
novel approach 9.41187E-4
