word frequency 0.002706431
weasel word 0.0026161839999999997
times word 0.002430738
word fre 0.002420856
training data 0.0021793719999999997
words wikipedia 0.0020790879999999998
test data 0.002075889
weasel words 0.002035474
domain data 0.001968207
specific words 0.001899538
annotated data 0.001876496
development data 0.001699466
testing data 0.001652535
corrupt data 0.00164858
words 0.00160188
different language 0.001569874
weasel tag 0.001511264
same sentence 0.0013599089999999999
ing corpus 0.001313097
training set 0.001279298
different languages 0.0012587240000000001
annotated corpus 0.0012526059999999999
different thresholds 0.001236427
linguistic features 0.0012235570000000001
test set 0.0011758150000000002
weasel tags 0.001109658
corpus statistics 0.001094042
information extraction 0.001064174
unique sentences 0.0010485630000000002
agreement tags 0.001016708
training instances 0.001007694
training seeds 9.96012E-4
weasel context 9.9008E-4
whole sentence 9.73515E-4
objectionable sentence 9.7102E-4
many wikipedia 9.69854E-4
work research 9.51215E-4
previous work 9.456180000000001E-4
high frequency 9.45099E-4
development test 9.42055E-4
natural language 9.415039999999999E-4
syntactic patterns 9.13996E-4
wikipedia weasel 9.10802E-4
balanced test 9.0153E-4
annotation weasel 8.940269999999999E-4
tnt tagger 8.882779999999999E-4
approach cov 8.78786E-4
tic patterns 8.68628E-4
relative frequency 8.66702E-4
many nlp 8.61207E-4
language versions 8.599E-4
first experiments 8.58543E-4
frequency measure 8.50929E-4
related work 8.460049999999999E-4
passive patterns 8.397719999999999E-4
future work 8.36676E-4
linguistic hedges 8.29431E-4
narrow domain 8.220780000000001E-4
language process 8.21855E-4
original annotation 8.20862E-4
linguistic hedging 8.18441E-4
seed set 8.17842E-4
small set 8.116620000000001E-4
information 8.07725E-4
multiple wikipedia 7.98552E-4
corpus 7.9276E-4
sentences 7.77002E-4
frequency measures 7.75148E-4
linguistic means 7.74035E-4
high score 7.72459E-4
wikipedia style 7.71077E-4
many domains 7.68228E-4
nlp research 7.63477E-4
detection system 7.63459E-4
training 7.62722E-4
wikipedia articles 7.61707E-4
balanced set 7.588670000000001E-4
ing proposition 7.5574E-4
total number 7.51254E-4
unannotated weasel 7.429369999999999E-4
sentence 7.38354E-4
weasel contexts 7.30613E-4
entire wikipedia 7.28364E-4
truth value 7.26431E-4
wikipedia edit 7.224320000000001E-4
further hedging 7.20554E-4
avgdist value 7.15331E-4
wikipedia editors 7.149960000000001E-4
manual annotations 7.089430000000001E-4
gold standard 7.084999999999999E-4
automatic detection 7.04971E-4
great number 6.93358E-4
small sample 6.90976E-4
weasel phrases 6.858649999999999E-4
features 6.83873E-4
potential weasel 6.8156E-4
main verbs 6.81216E-4
large margin 6.74548E-4
such constellations 6.71278E-4
cific weasel 6.65308E-4
