data problem 0.0015985080000000002
many information 0.001597177
same feature 0.001538808
common information 0.001522201
similar pairs 0.001493328
semantic features 0.001481773
information retrieval 0.00146835
sparse data 0.001462254
sense information 0.0014418880000000001
linguistic information 0.001433223
standard information 0.001431681
new features 0.001413126
information theory 0.001390698
available information 0.001385422
ion corpus 0.001373376
information access 0.001368309
similar text 0.001361896
different decision 0.001352496
same topic 0.001340433
potential features 0.00130329
same time 0.001300668
same class 0.001288415
word noun 0.001273383
feature values 0.001255273
text similarity 0.001254229
additional features 0.0012355830000000002
other methods 0.00122318
primitive features 0.001221852
pilot corpus 0.001220138
entire corpus 0.001217828
different points 0.001205311
tdt corpus 0.001205168
training set 0.001205112
annotated corpus 0.00119901
standardized corpus 0.001189411
different types 0.001180568
linguistic features 0.001173246
new method 0.0011708299999999999
different sim 0.00116299
different radeoffs 0.001158596
different topics 0.001158596
different cutoffs 0.001158596
composite features 0.001153421
tive features 0.001126497
similarity similarity 0.001123474
learning method 0.001119327
information 0.00111578
same description 0.001114683
text units 0.0011100189999999998
dissimilar pairs 0.001097093
summarization system 0.001096842
same head 0.001094998
feature vector 0.001091714
graph pairs 0.001088852
posite features 0.001087955
ite features 0.001087955
thesaural features 0.001087955
small text 0.00108133
same focus 0.0010739880000000001
primitive feature 0.00107197
marked pairs 0.001071774
same event 0.0010698370000000001
same action 0.001067498
paragraph pairs 0.001066347
same synset 0.001064997
feature value 0.001051817
word occurrences 0.0010493590000000001
semantic distance 0.001045147
text length 0.001035592
single words 0.0010301
other hand 0.001022672
other types 0.001011756
other primitives 0.001010959
independent feature 0.001008302
present results 0.00100729
word primitives 0.001007249
text clustering 0.001006334
composite feature 0.001003539
text unit 9.978629999999999E-4
experimental results 9.93757E-4
feature names 9.89165E-4
text analysis 9.88102E-4
gle word 9.8594E-4
text matching 9.81566E-4
empirical results 9.64761E-4
feature name 9.64171E-4
feature verb 9.63978E-4
training orientation 9.63901E-4
similar ity 9.633039999999999E-4
relative order 9.594289999999999E-4
current results 9.58263E-4
corpus 9.57427E-4
similar verbs 9.57229E-4
order distance 9.57158E-4
routine training 9.5158E-4
document similarity 9.501869999999999E-4
tool set 9.4918E-4
last method 9.481660000000001E-4
training orien 9.46861E-4
shared words 9.46428E-4
