dialect word 0.00379142
german word 0.003426483
word probability 0.003240491
word form 0.003085743
word maps 0.003036315
dialectal word 0.003017697
word list 0.0029966570000000002
word stem 0.002980402
word transformation 0.002963002
word frequency 0.002952222
word forms 0.002951744
word level 0.002948442
word derivations 0.002906674
georeferenced word 0.002867063
erroneous word 0.002864359
minimum word 0.002860773
word lookup 0.002857096
dialect model 0.0027039300000000002
language model 0.002506363
dialect words 0.00246805
different words 0.00231336
dialect data 0.0021636900000000002
german words 0.002103113
baseline model 0.0019300530000000002
german dialect 0.001929903
many words 0.0018613229999999998
test data 0.001821436
current model 0.001814113
german data 0.0017987530000000002
line model 0.0017773910000000001
cation model 0.0017714130000000001
dialect text 0.0017282509999999999
information system 0.0016901989999999999
language models 0.001687009
function words 0.00165245
training data 0.0016503610000000001
web dialect 0.001636624
frequency words 0.0016288519999999999
dialect spelling 0.001621902
entire words 0.001609161
different approach 0.001599614
different rules 0.0015841000000000002
such data 0.0015586290000000002
model 0.00155651
rare words 0.001547887
additional dialect 0.00154537
data set 0.001542034
web data 0.0015054740000000001
local dialect 0.0015020209999999999
dialect identification 0.001476813
spelling system 0.001466742
dialect area 0.001455704
data source 0.001449048
dialect variation 0.001446832
dialect region 0.001434968
first language 0.001430857
different dialects 0.001427151
dialect max 0.00142155
dialect use 0.0014172879999999998
dialect regions 0.001402735
wikipedia data 0.001393497
main data 0.0013887020000000001
dialect literature 0.0013816089999999998
dialect writers 0.001381001
dialect writing 0.0013800969999999998
different evaluation 0.001379583
dialect parsing 0.0013793899999999999
internal dialect 0.0013790949999999999
alemannic dialect 0.0013754099999999999
german lexicon 0.0013730040000000002
unique dialect 0.001371886
dialect group 0.0013688049999999998
dialect landscape 0.0013687769999999998
dialect categories 0.001366705
test corpus 0.0013657399999999998
homogeneous dialect 0.0013656719999999998
data points 0.0013655380000000001
dialect identifi 0.001364447
gold dialect 0.001364255
dialect discrimina 0.0013608709999999998
berne dialect 0.001360294
dialect contin 0.001360294
spoken dialect 0.001360294
fribourg dialect 0.001360294
bern dialect 0.001360294
german corpus 0.001343057
test set 0.00133093
lexical rules 0.001328634
words 0.00132063
standard german 0.0013136
phonetic rules 0.001313247
other models 0.0013061219999999998
different types 0.001303179
web test 0.00129437
various language 0.001283914
language identification 0.001279246
digital data 0.001263692
different derivations 0.0012554040000000001
tic data 0.0012545710000000001
different languages 0.001253313
