training data 0.00434627
text data 0.00432291
test data 0.003980602
normal data 0.003970569
simple data 0.003922913
data set 0.003919654
simplification data 0.0038982979999999997
english data 0.0038598699999999996
additional data 0.003837733
language model 0.00382594
perplexity data 0.003813373
data source 0.0036744459999999996
data sources 0.003631775
data sets 0.00362723
general data 0.003626851
unsimplified data 0.003595961
data sizes 0.003572484
data increases 0.003569644
combined data 0.003564904
data estimation 0.003562325
mal data 0.003561233
compression data 0.003559543
real data 0.003557822
data impacts 0.003555492
data scarcity 0.003552507
uncompressed data 0.003552507
data sparsity 0.003552507
normal model 0.002824719
model adaptation 0.002813546
simple model 0.0027770629999999998
model domain 0.002710106
model performance 0.002678425
model perplexity 0.0026675229999999998
language models 0.0026340310000000002
model evaluation 0.002545637
order model 0.002446638
model estimates 0.002435546
guage model 0.002430565
combined model 0.002419054
mal model 0.002415383
model com 0.002411537
lated model 0.002408899
model perplexities 0.002408505
polated model 0.002407994
model esti 0.002407633
model perplexi 0.00240676
simple language 0.0023034229999999998
english language 0.00224038
model 0.00214979
single language 0.002087242
language modeling 0.002033864
other text 0.002017896
language use 0.002006201
output language 0.001989722
language mod 0.001981206
trigram language 0.001939184
sri language 0.0019338699999999999
varied language 0.0019332899999999998
training corpus 0.0017422560000000002
normal training 0.0017255590000000002
normal text 0.0017021990000000002
language 0.00167615
normal models 0.0016328100000000002
text simplification 0.001629928
ing text 0.001616629
other simplification 0.001593284
english text 0.0015915
word level 0.001556821
other domain 0.001550942
word distributions 0.001494565
word occurrence 0.00148648
translation performance 0.0014372019999999998
many text 0.0014294490000000002
other vocabulary 0.001385029
normal corpus 0.001366555
text sources 0.001363405
text sequence 0.001358884
general text 0.001358481
vocabulary models 0.001352284
machine translation 0.001351236
text sim 0.001348484
translation task 0.001346643
recent text 0.001322049
simple corpus 0.0013188990000000001
test set 0.0013089759999999999
text simplifi 0.001293179
text compression 0.001291173
other systems 0.001290721
ple text 0.001286936
text simplicity 0.0012851260000000002
text varia 0.0012851260000000002
unsummarized text 0.0012851260000000002
monolingual translation 0.0012792839999999999
translation tasks 0.00127703
only models 0.001275326
other lan 0.001274574
other factors 0.0012571180000000002
other simplifi 0.001256535
other monolin 0.001252777
other dimension 0.001252658
