word context 0.003956331
other word 0.003755483
results word 0.003682915
new word 0.003660375
word stem 0.0035977360000000002
word types 0.003524994
word clustering 0.0035187540000000002
word sense 0.003491136
related word 0.0034674140000000003
word forms 0.003463206
word clusters 0.003454406
short word 0.003454008
supervised word 0.003453542
word pairs 0.003451905
word form 0.003434279
word senses 0.00339121
frequent word 0.003381504
rare word 0.003350206
unrelated word 0.003344474
content word 0.003342454
model cluster 0.002291674
such words 0.002133452
english words 0.002103561
bayesian model 0.002071993
model output 0.002029831
common words 0.001990853
clustering words 0.001963334
pipeline model 0.001952283
ing words 0.001940606
count words 0.001917252
related words 0.001911994
short words 0.001898588
segment words 0.001880901
listing words 0.0018086410000000002
ambiguous words 0.0017917410000000001
unrelated words 0.001789054
model 0.00171299
words 0.00156185
different candidate 0.001485656
large data 0.0014544050000000002
data set 0.001432026
other models 0.001387437
training corpus 0.0013428749999999999
large corpus 0.001327377
real data 0.00129377
mayan language 0.001280847
similar data 0.001277154
small data 0.001252597
data sets 0.001249571
data structure 0.001245931
other candidate 0.0012409259999999998
language uspanteko 0.001232641
language documentation 0.001230062
semantic context 0.001225863
global corpus 0.001215541
same cluster 0.0012125489999999998
data sparseness 0.001190038
sophisticated data 0.001182234
data structures 0.001181398
data sizes 0.001178091
other affix 0.001156904
other information 0.0011493570000000002
context vector 0.0011464119999999999
corpus counts 0.001137489
affix candidate 0.0011214039999999999
different stages 0.0011196259999999999
different means 0.001115521
bayesian models 0.001108227
affix cluster 0.001097375
segmentation method 0.0010876990000000001
multiple cluster 0.001087316
stem candidate 0.0010831789999999999
many parameters 0.0010799540000000002
unsupervised morphology 0.001077214
annotated corpus 0.001068453
entire corpus 0.001060081
gigaword corpus 0.001048405
morphology acquisition 0.00104763
simple approach 0.001045583
other work 0.001041807
possible stem 0.001028102
standard values 0.0010236709999999999
standard clusters 0.001022324
inflectional morphology 0.001014665
productive morphology 0.001004139
language 0.00100255
global candidate 9.96498E-4
other candidates 9.92266E-4
affix pair 9.82582E-4
single stem 9.79022E-4
gold standard 9.775769999999999E-4
derivational morphology 9.77132E-4
many types 9.7327E-4
candidate generation 9.64061E-4
standard thresholds 9.613989999999999E-4
same trie 9.572229999999999E-4
pervised morphology 9.384290000000001E-4
tual morphology 9.35029E-4
semantic similarity 9.34344E-4
tional morphology 9.326E-4
