topic model 0.0054726
topic models 0.004810849
topic modeling 0.004736112000000001
modeling topic 0.004736112000000001
specific topic 0.004597905
topic distributions 0.004585004
topic classification 0.004579975
topic vector 0.00452024
single topic 0.004447971
topic distribu 0.004435658
gate topic 0.004426852
binary topic 0.004415153
several topic 0.00441317
vectors topic 0.004399358
topic vectors 0.004399358
topic distri 0.004395293
corresponding topic 0.004393565
topic classifier 0.0043917190000000005
representative topic 0.00438493
aggregate topic 0.004383214
topic vec 0.004382787
discriminative topic 0.004379783
topic mod 0.004379233
topic clas 0.00437238
stanford topic 0.004369172
fines topic 0.004369172
topic 0.00416064
text data 0.001980357
binary topics 0.001697193
model output 0.00157626
medical data 0.00157062
model representation 0.00156976
ing data 0.001551881
erative model 0.001520549
patient data 0.001497902
top words 0.001473633
data mining 0.0014683259999999998
data abstractor 0.001459139
trained data 0.001459139
topics 0.00144268
dirichlet distribution 0.001435379
different training 0.001420907
stop words 0.001318063
frequent words 0.001316036
model 0.00131196
quent words 0.001311002
infrequent words 0.001309943
text classification 0.001150072
semantic analysis 0.001136773
different number 0.0011041620000000001
words 0.00110056
training dataset 0.001092508
semantic structure 0.001037016
text collection 0.001032765
modeling lda 0.001025986
training labels 0.0010132539999999999
different proportions 9.84924E-4
particular document 9.84143E-4
regular text 9.83423E-4
document collection 9.81972E-4
free text 9.78887E-4
raw text 9.5382E-4
text classifi 9.41103E-4
put text 9.41103E-4
latent dirichlet 9.287329999999999E-4
clinical information 9.15487E-4
test set 9.113610000000001E-4
distribution 9.00454E-4
classification algorithm 8.86204E-4
modeling techniques 8.825510000000001E-4
class distributions 8.631769999999999E-4
other parts 8.62578E-4
automated analysis 8.622650000000001E-4
classification approach 8.4208E-4
same meaning 8.115869999999999E-4
results classification 8.09742E-4
classification results 8.09742E-4
dataset results 8.02857E-4
modeling toolbox 7.85312E-4
sampling approach 7.64154E-4
sampling methods 7.61532E-4
good representation 7.540649999999999E-4
dirichlet alloca 7.467050000000001E-4
original dataset 7.40038E-4
mantic analysis 7.34771E-4
good outcome 7.31214E-4
classification performance 7.291929999999999E-4
classification techniques 7.26414E-4
set proportions 7.26375E-4
testing set 7.15971E-4
generative system 7.09772E-4
positive class 7.075709999999999E-4
test datasets 7.02792E-4
automated systems 7.00839E-4
gibbs sampling 6.97274E-4
classification technique 6.93136E-4
class proportions 6.82888E-4
real dataset 6.811460000000001E-4
erage function 6.80987E-4
training 6.80058E-4
