document model 0.002374632
same document 0.002200331
document length 0.0021581639999999997
class document 0.002021184
suspicious document 0.001991674
document fragments 0.0019493919999999999
whole document 0.001935089
source document 0.001919807
document fragment 0.001916313
document collections 0.0019016530000000001
cious document 0.00184794
other methods 0.001702802
detection method 0.0016420389999999999
class method 0.001606744
document 0.00157971
supervised method 0.0015120429999999998
new features 0.0014695300000000001
method oberreuter 0.001422612
author text 0.001412931
classification model 0.0014092039999999998
new text 0.001375074
authorship attribution 0.001368663
suspicious documents 0.001362804
different techniques 0.001348077
different authors 0.001325914
plagiarism detection 0.001313099
test corpus 0.00130338
tion methods 0.001291727
same corpus 0.001267209
authorship clustering 0.001251303
different options 0.001243945
different combi 0.001243945
arabic documents 0.001224044
cious documents 0.00121907
feature value 0.001217621
stylistic features 0.001214896
literature methods 0.001213003
spanish documents 0.0012078990000000001
intrinsic plagiarism 0.0011780620000000001
plagiarism source 0.001176427
other segmentation 0.0011721029999999999
method 0.00116527
small number 0.001138229
plagiarism direction 0.0011112510000000002
sentence length 0.001111202
text fragments 0.001110423
plagiarism prediction 0.001104881
training corpus 0.0011031980000000001
evaluation corpus 0.001101764
plagiarism detec 0.001100354
sic plagiarism 0.001094092
original text 0.001090674
text representation 0.0010808279999999998
authorship identification 0.0010778609999999998
text fragment 0.001077344
frequency class 0.001068086
paragraph authorship 0.001062179
proportion model 0.00105714
intrinsic approach 0.001056702
corpus docu 0.0010543710000000001
sification model 0.001052813
other hand 0.001051582
authorship clus 0.0010473779999999999
evaluation results 0.001044466
authorship verifica 0.001042568
similar results 0.001041938
total number 0.001037155
several classification 0.001015427
research problem 0.001009624
text repre 0.00100351
first value 9.91926E-4
many nlp 9.89297E-4
style analysis 9.861969999999999E-4
many problems 9.82566E-4
such applications 9.824249999999999E-4
frequency level 9.73853E-4
supervised classification 9.610549999999999E-4
documents 9.5084E-4
chosen number 9.455679999999999E-4
potential author 9.44369E-4
frequency levels 9.26002E-4
average sentence 9.17122E-4
methods 9.1635E-4
inara corpus 9.161200000000001E-4
standardized corpus 9.03908E-4
intermediate frequency 9.020300000000001E-4
classification algo 8.974149999999999E-4
second value 8.925739999999999E-4
novel language 8.901340000000001E-4
future work 8.797950000000001E-4
evaluation measures 8.63025E-4
comparable results 8.58386E-4
main difficulties 8.56106E-4
several papers 8.50054E-4
bayes algorithm 8.478369999999999E-4
standard corpora 8.43112E-4
plagiarism 8.3633E-4
grams length 8.36264E-4
features 8.35197E-4
important component 8.26998E-4
