word distribution 0.0028478870000000003
word clustering 0.002590105
word cluster 0.002564545
word tokens 0.002452961
topical word 0.0024020540000000003
word distributions 0.002369345
multinomial word 0.002364316
word counts 0.002293868
word wij 0.002245792
word probabili 0.0022450630000000003
glish word 0.002230156
same topic 0.002100083
topical words 0.002033624
model results 0.0020013130000000002
sorts words 0.001895693
frequent words 0.001885336
background words 0.001884725
ical words 0.001880105
share words 0.001861424
topic detection 0.001814895
model parameters 0.001678074
words 0.00163703
analysis model 0.0016307560000000001
model log 0.001629142
structure model 0.001617733
ltb model 0.001614886
generative model 0.001593751
occc model 0.0015736580000000001
graphical model 0.001544748
brid model 0.001492677
topic 0.00146431
other data 0.001440826
other documents 0.001413943
clustering documents 0.001398869
such documents 0.001341018
document cluster 0.001328059
model 0.00126989
core documents 0.001233681
test documents 0.001230124
topical documents 0.001210818
learning documents 0.001208139
clustering method 0.0011996139999999999
core document 0.001188431
posterior distribution 0.001187454
good results 0.001177178
clustering models 0.001170965
data chunks 0.001094597
joint distribution 0.001094157
background distribution 0.001090122
dataset likelihood 0.001087165
group documents 0.001085398
rior distribution 0.001079238
high probability 0.001077591
data instances 0.001068145
wad documents 0.001066851
clustering methods 0.001061238
high values 0.001057998
long documents 0.001055538
unlabeled documents 0.001052074
text clustering 0.001047622
represent documents 0.001037707
press documents 0.001037707
sort documents 0.001037707
document topicality 0.0010347009999999999
high value 0.001021307
clustering task 0.001012368
document vec 9.93206E-4
parameter values 9.87676E-4
cluster size 9.85989E-4
space models 9.79714E-4
core cluster 9.78542E-4
analysis method 9.75835E-4
excellent results 9.73235E-4
superior results 9.6529E-4
method ltb 9.59965E-4
ltb method 9.59965E-4
indistinguishable results 9.568000000000001E-4
clustering problem 9.34467E-4
ltb models 9.31316E-4
occc algorithm 9.24165E-4
occc method 9.18737E-4
method occc 9.18737E-4
new score 8.98102E-4
other machine 8.914839999999999E-4
other applications 8.84854E-4
same length 8.84117E-4
clustering accuracy 8.817909999999999E-4
new representation 8.809270000000001E-4
simple information 8.77084E-4
method wad 8.67596E-4
theoretic algorithm 8.5838E-4
binary models 8.52542E-4
main parameters 8.47582E-4
core size 8.463609999999999E-4
distribution 8.42427E-4
wad dataset 8.41964E-4
core clusters 8.400269999999999E-4
text classification 8.392359999999999E-4
following result 8.355050000000001E-4
generative process 8.33203E-4
