model training 0.0032472229999999996
different language 0.003073009
model size 0.003003067
model probabilities 0.002986736
model scores 0.002975159
figure model 0.002899017
language code 0.002799042
language identification 0.0027951879999999997
trained model 0.002783359
language development 0.0027803859999999997
model sizes 0.0027732499999999997
default model 0.002747663
computing model 0.002746359
native language 0.002673774
language identifiers 0.002663665
language klingon 0.002662543
language identifier 0.002655971
model 0.00249887
language 0.00240715
training data 0.002327213
test data 0.002184144
online data 0.0019037860000000002
data sets 0.0018646490000000001
data streams 0.001830021
data the 0.001827775
the data 0.001827775
probability values 0.001436175
other mapping 0.00139635
training corpus 0.001346467
other code 0.00127155
other lan 0.0012599059999999999
different values 0.001255553
probability distribution 0.001253695
test set 0.001155285
mapping function 0.0011497830000000001
other programs 0.001129468
probability mass 0.001098534
overwhelming probability 0.001096681
search engine 0.001088627
test text 0.001072073
first mapping 0.001065035
hash table 0.00103618
trained models 0.001001875
mnogosearch search 0.001001739
different fingerprint 0.0010015319999999999
test string 9.891029999999999E-4
different writing 9.66087E-4
baseline error 9.512380000000001E-4
different gamma 9.48497E-4
test strings 9.44422E-4
bayes approach 9.28234E-4
score mapping 9.24018E-4
development set 9.23237E-4
hash entry 9.046900000000001E-4
subset corpus 8.97793E-4
numerous languages 8.936700000000001E-4
frequency mapping 8.93511E-4
future work 8.919119999999999E-4
baseline performance 8.900010000000001E-4
langid corpus 8.878649999999999E-4
logarithm function 8.860960000000001E-4
maximum size 8.82055E-4
additional lan 8.721800000000001E-4
pluricentric languages 8.698830000000001E-4
test files 8.68275E-4
europarl corpus 8.574939999999999E-4
tau values 8.567609999999999E-4
unreserved test 8.53194E-4
similar performance 8.513080000000001E-4
trenkle approach 8.49879E-4
optimal values 8.48088E-4
probability 8.46481E-4
classification error 8.45619E-4
fingerprint size 8.398699999999999E-4
ent values 8.39364E-4
mapping functions 8.3792E-4
mentary information 8.27366E-4
hash tables 8.245360000000001E-4
second mapping 8.23051E-4
error rate 8.2189E-4
logarithmic term 8.17814E-4
random gaussian 8.10788E-4
mapping func 8.08877E-4
little work 8.00446E-4
gamma mapping 7.9933E-4
error rates 7.92106E-4
baseline condition 7.91158E-4
related work 7.89805E-4
program error 7.86765E-4
loglike mapping 7.79822E-4
mapping program 7.73186E-4
table entries 7.72525E-4
minor text 7.652150000000001E-4
nonlinear mapping 7.64842E-4
implicit mapping 7.64842E-4
words 7.63736E-4
relative percentage 7.614939999999999E-4
lookup table 7.60609E-4
penalty score 7.59877E-4
absolute performance 7.53356E-4
