language model 0.00464804
language models 0.003751389
language identification 0.003436865
automatic language 0.003429731
majority language 0.003371049
language resources 0.0033457929999999997
implementation language 0.0033378459999999998
language 0.00307261
word level 0.002712164
word classification 0.002548552
word accuracy 0.002500615
only word 0.002444634
current word 0.002430919
individual word 0.002413644
word boundaries 0.00235667
word frequencies 0.002333673
other words 0.002179224
regression model 0.001893013
individual words 0.001806464
multilingual data 0.001755174
dividual words 0.00172692
model 0.00157543
words 0.00146746
text segments 0.001338195
multilingual web 0.001318905
ing character 0.001303443
multilingual documents 0.001296303
same features 0.0012939
same character 0.001261382
multilingual online 0.001238744
segmented text 0.001207302
following features 0.001171598
multilingual communication 0.001170146
multilingual speakers 0.0011382900000000001
feature values 0.0010985069999999999
test set 0.001098463
document level 0.001088821
character sequences 0.001088494
corpus our 0.001078083
level accuracy 0.0010634989999999999
random set 0.001058994
multilingual conversa 0.001046588
authorship corpus 0.001044936
such resources 0.001037502
sentence level 0.001021081
training corpora 0.001019076
probability values 0.0010185770000000001
ditional features 0.0010099
large online 9.95241E-4
ing dictionaries 9.47828E-4
english phrases 9.47577E-4
large segments 9.3139E-4
nority languages 9.275430000000001E-4
document classification 9.25209E-4
conditional random 9.048400000000001E-4
log probability 9.01942E-4
english blogs 8.910820000000001E-4
same post 8.81258E-4
first names 8.77673E-4
classification problem 8.77635E-4
future work 8.54634E-4
ing demand 8.493649999999999E-4
online communication 8.44284E-4
dutch web 8.40602E-4
large scale 8.3302E-4
turkish web 8.30122E-4
following tokens 8.29668E-4
monolingual texts 8.28068E-4
web pages 8.271629999999999E-4
lookup approach 8.1601E-4
post classification 8.08281E-4
learning classifiers 8.043169999999999E-4
sequence labeling 8.01181E-4
dictionary lookup 7.87584E-4
online forum 7.79571E-4
corpus 7.63689E-4
spelling variations 7.63321E-4
machine learning 7.6059E-4
wikipedia pages 7.53374E-4
classification fraction 7.50378E-4
realistic texts 7.488799999999999E-4
short texts 7.48868E-4
features 7.47011E-4
good news 7.46954E-4
related work 7.465390000000001E-4
monolingual speakers 7.423499999999999E-4
manual identification 7.370580000000001E-4
online name 7.34049E-4
various machine 7.32301E-4
following contributions 7.31574E-4
porate context 7.291680000000001E-4
code switching 7.285659999999999E-4
natural texts 7.28493E-4
additional con 7.261069999999999E-4
online communities 7.20158E-4
online environ 7.18738E-4
tilingual online 7.18738E-4
online discussions 7.18738E-4
public dataset 7.17454E-4
character 7.14493E-4
