language model 0.00385535
language first 0.0032854579999999998
same language 0.003151952
texts language 0.003101085
specific language 0.003061409
language identification 0.002976379
single language 0.002968203
statistical language 0.0029553319999999998
language sequences 0.002949154
language pair 0.002943439
major language 0.00293747
language border 0.002904589
language pairs 0.002898927
unknown language 0.002898382
language family 0.002889541
language modeling 0.002885519
own language 0.002884612
ent language 0.002861819
cal language 0.002854741
language change 0.002851794
language identifi 0.002850921
language identifica 0.002849397
cipal language 0.002848746
language fam 0.002848746
language 0.00261554
training data 0.0020095729999999997
first data 0.001964618
learning data 0.001948308
data set 0.0019474919999999999
different languages 0.001901058
corpus data 0.001853095
test data 0.001831539
monolingual data 0.0017754379999999998
other languages 0.0017082959999999998
linguistic data 0.001694711
text length 0.001689742
second data 0.001663465
input data 0.0016543999999999999
data sets 0.001601138
sample data 0.001596686
monolingual text 0.001585398
wikipedia data 0.0015670459999999999
multilingual text 0.001543629
sufficient data 0.001531481
model type 0.001517057
text classification 0.001472314
text source 0.001466397
input text 0.00146436
many languages 0.001460845
text segmentation 0.001454578
text segment 0.001443745
target languages 0.001417645
text sample 0.001406646
text portion 0.001404519
text size 0.0013966349999999998
word segmentation 0.001383198
lingual text 0.001383042
correct languages 0.001382865
suitable text 0.0013564949999999999
major languages 0.00135623
text portions 0.001350997
text segments 0.001349765
text chunk 0.0013443169999999998
european languages 0.001344126
text clas 0.001343253
plain text 0.001342519
embedded text 0.001339664
put text 0.001338945
text tiling 0.001338003
text foro 0.001338003
coherent text 0.001338003
word detection 0.001316138
word borders 0.00131529
corresponding languages 0.0013147369999999999
other words 0.001309597
different lan 0.0012899510000000001
different methods 0.00128693
tiple languages 0.001286553
foreign word 0.0012731139999999999
model 0.00123981
problem set 0.001221703
learning corpus 0.0012120030000000001
test set 0.001189631
first term 0.0011381590000000001
large learning 0.001104315
multilingual learning 0.001092577
such texts 0.001076016
learning methods 0.0010737799999999999
common set 0.001068142
following scores 0.001068028
empirical results 0.001057249
monolingual corpus 0.001039133
languages 0.0010343
specific analysis 0.001031049
machine learning 0.0010231279999999999
set con 0.001020842
specific problem 0.0010147799999999998
following terms 0.001013152
supervised learning 0.001010744
learning corpora 9.94058E-4
