large word 0.002759516
word alignment 0.002556799
computing word 0.002455442
word pair 0.0023752409999999997
complete word 0.002345747
times word 0.0023197969999999997
building word 0.00227688
word align 0.002273162
training data 0.001887938
large cluster 0.001838586
cluster size 0.001684264
data algorithms 0.0016066680000000001
machine cluster 0.001588944
practical cluster 0.001569451
time data 0.001558099
data point 0.001551258
language models 0.001540684
unique words 0.0015356880000000001
cluster computing 0.001534512
various cluster 0.001497874
hadoop cluster 0.0014948379999999999
data structure 0.001490379
instance cluster 0.0014685459999999998
programming model 0.001456719
sense clustering 0.0014527250000000002
data increases 0.0014518180000000001
necessary data 0.00144181
basic data 0.001441034
entire cluster 0.001439983
cluster sizes 0.0014103689999999999
data transfer 0.001399867
alternative model 0.001395111
data transfers 0.001389033
data blocks 0.001389033
data center 0.001389033
cluster comput 0.001353667
modest cluster 0.0013512939999999998
model estimation 0.001339129
large clusters 0.0013210890000000001
gramming model 0.001308929
corpus size 0.001296647
words 0.00126525
large number 0.001251724
learning algorithm 0.0012439460000000001
language processing 0.001207107
natural language 0.001180359
large corpora 0.001149039
cluster 0.00113628
scalable language 0.001122455
context vectors 0.001120714
building language 0.001113818
training set 0.001099144
model 0.00108683
unsupervised sense 0.001085716
large amount 0.001079732
different nodes 0.001077925
corpus con 0.001068856
different widow 0.001052821
different percentages 0.001049905
machine translation 0.001038677
complete corpus 0.0010372
gigaword corpus 0.0010168360000000001
programming models 0.001016425
corpus linguistics 0.001015015
stripes algorithm 9.90149E-4
many information 9.86151E-4
hadoop clusters 9.77341E-4
same key 9.74965E-4
large datasets 9.659790000000001E-4
gaword corpus 9.63254E-4
same experiment 9.57927E-4
large class 9.52789E-4
same cell 9.49999E-4
computer clusters 9.458240000000001E-4
machine learning 9.39491E-4
large num 9.3575E-4
large numbers 9.348199999999999E-4
large sorting 9.279269999999999E-4
large quantities 9.16907E-4
large body 9.16907E-4
first point 9.07889E-4
learning task 8.976590000000001E-4
intermediate results 8.96785E-4
language 8.94148E-4
window size 8.922369999999999E-4
pairs approach 8.86036E-4
feature selection 8.83226E-4
relative size 8.807749999999999E-4
input input 8.75782E-4
english corpora 8.735150000000001E-4
commodity clusters 8.564800000000001E-4
information processing 8.534150000000001E-4
such ser 8.5304E-4
such wastage 8.4936E-4
such capabilities 8.4936E-4
experimental results 8.49089E-4
value pairs 8.47074E-4
mapreduce algorithms 8.449250000000001E-4
parallel algorithms 8.433830000000001E-4
matrix costs 8.39921E-4
