illegal data 0.00167427
advertising data 0.0016351010000000001
false information 0.001568722
product information 0.001516596
classification model 0.001447884
government data 0.0014300180000000001
word vector 0.001415646
detailed information 0.001414078
mented data 0.0014082410000000002
misleading information 0.0013963340000000002
different rejection 0.0013779460000000001
different combinations 0.0013645810000000001
training set 0.0013640929999999998
different languages 0.001319132
word segmentation 0.001315657
different applica 0.0013098110000000001
different rejec 0.0013098110000000001
segmenter word 0.001295981
word segmenter 0.001295981
word list 0.0012795200000000001
problematic words 0.001217073
sentiment words 0.001204384
frequent words 0.001198892
total words 0.001162237
information 0.00114987
model files 0.001144096
common words 0.001135701
training tasks 0.001122486
quent words 0.001114952
illegal sentence 0.0011067949999999998
illegal sentences 0.001103715
classification models 0.001094554
useful results 0.001090365
same way 0.001085573
test set 0.001075701
different 0.00106645
correct sentences 0.0010646
balanced corpus 0.001044799
standard corpus 0.001034295
classification methods 0.001018952
advertising classification 0.001018523
models advertising 0.001013773
illegal class 9.93708E-4
pos tagging 9.92611E-4
correct score 9.799939999999999E-4
illegal advertising 9.76911E-4
sentence recognition 9.70296E-4
mining method 9.540950000000001E-4
other dataset 9.48768E-4
problematic sentences 9.47971E-4
recommendation method 9.4589E-4
legal sentences 9.426160000000001E-4
target class 9.37095E-4
svm models 9.30023E-4
other hand 9.13529E-4
other types 9.099150000000001E-4
document sentence 9.08483E-4
model 8.98232E-4
binary classification 8.93369E-4
total sentences 8.93135E-4
other lan 8.921739999999999E-4
recognition system 8.90881E-4
binary models 8.88619E-4
overall system 8.88425E-4
sentence segmenter 8.84556E-4
sentence boundary 8.80785E-4
sentences identification 8.796050000000001E-4
text field 8.73579E-4
cos models 8.71677E-4
words 8.64777E-4
classification mod 8.62175E-4
large number 8.61785E-4
sentence levels 8.59965E-4
food models 8.59081E-4
future work 8.582990000000001E-4
advertising research 8.57785E-4
illegal legal 8.54981E-4
legal illegal 8.54981E-4
logrf values 8.51018E-4
relative frequency 8.505100000000001E-4
single class 8.456290000000001E-4
many advertisements 8.443019999999999E-4
sentence boundaries 8.41657E-4
cation models 8.41142E-4
online advertising 8.408809999999999E-4
high performance 8.40824E-4
advertising con 8.38179E-4
incoming sentences 8.377500000000001E-4
legal class 8.32609E-4
results 8.3181E-4
statement classification 8.31644E-4
illegal advertisements 8.275419999999999E-4
our classification 8.27463E-4
such statements 8.27238E-4
classiﬁcation models 8.26486E-4
illegal datasets 8.229839999999999E-4
illegal food 8.222189999999999E-4
recognition module 8.20651E-4
training 8.18036E-4
related work 8.17094E-4
