text documents 0.0026565499999999997
other documents 0.002547948
french documents 0.002188589
english documents 0.002055104
full documents 0.002052212
only documents 0.002027469
czech documents 0.00198088
entire documents 0.001975802
unseen documents 0.001968295
xml documents 0.001929805
splitting documents 0.001929805
new document 0.001912811
document classification 0.001909598
document length 0.001772992
average document 0.001725439
document source 0.00169752
documents 0.001674
binary document 0.00166057
whole document 0.001652679
other words 0.00164623
document catego 0.001645875
other method 0.0015491110000000002
other languages 0.0014009970000000002
document 0.00138762
additional information 0.00134308
text categorization 0.00133971
test set 0.00133936
fragments method 0.0012770289999999998
text summarization 0.001271156
plain text 0.00125182
other algorithms 0.001249025
text categoriza 0.001244271
enough information 0.001241181
different learning 0.0012184589999999999
first method 0.001210901
learning set 0.001202645
model 0.00116592
significant word 0.001156158
other hand 0.001142203
other kinds 0.001130347
topic recognition 0.001125846
main results 0.001104758
support vector 0.001094992
vector machines 0.001094731
different weights 0.0010941009999999999
classification accuracy 0.0010909449999999998
initial fragments 0.001054301
second method 0.001024385
classification fragment 0.001017369
information 9.74886E-4
same label 9.52549E-4
probability ratio 9.5084E-4
same cardinality 9.48853E-4
initial fragment 9.478259999999999E-4
related work 9.43986E-4
method sim 9.40693E-4
first sentences 9.10739E-4
data sets 9.07855E-4
sign test 9.00582E-4
accuracy increase 8.842349999999999E-4
tial fragments 8.64187E-4
fident fragments 8.59717E-4
dent fragments 8.59717E-4
several experiments 8.50953E-4
classification tasks 8.41205E-4
vector 8.22802E-4
topic 8.19496E-4
novel methods 8.19295E-4
bayes classifier 8.07728E-4
french cooking 8.04458E-4
single class 8.03255E-4
first split 7.96953E-4
future research 7.915190000000001E-4
french recipes 7.90043E-4
learning algorithms 7.888039999999999E-4
smo algorithm 7.73752E-4
words 7.72282E-4
average number 7.59181E-4
evaluation criterion 7.57725E-4
important result 7.55254E-4
initial part 7.48883E-4
similar behaviour 7.465600000000001E-4
novel approach 7.44262E-4
experimental settings 7.29625E-4
frequent class 7.26994E-4
average length 7.23191E-4
initial frag 7.17551E-4
next stage 6.97402E-4
related works 6.905489999999999E-4
results 6.87761E-4
minimum length 6.8146E-4
decision tree 6.80406E-4
main characteristics 6.79186E-4
significant increase 6.78117E-4
cant number 6.778859999999999E-4
method 6.75163E-4
relative increase 6.72254E-4
source docu 6.67105E-4
optimal length 6.6101E-4
medical papers 6.600169999999999E-4
