training corpus 0.001931277
corpus type 0.0017222560000000001
word sequence 0.001656234
rules training 0.001581774
text corpus 0.001545861
word sense 0.001515641
testing corpus 0.001500934
large corpus 0.001478109
preceding word 0.001464428
word clustering 0.001463539
other words 0.0014395150000000002
information extraction 0.0014102239999999999
annotated corpus 0.001392173
name type 0.001389664
parsed corpus 0.0013634150000000002
raw corpus 0.001358068
similar words 0.0013551890000000001
data set 0.0013524589999999999
name list 0.001337029
constructed corpus 0.001321799
markov model 0.001303527
common words 0.001295145
list learning 0.0012765279999999999
large training 0.001272406
sparse data 0.00126778
training instances 0.001252575
hmm training 0.001248519
person names 0.0012287399999999999
corresponding nes 0.001225481
training process 0.001210769
same sentence 0.0012019650000000002
similar context 0.001174099
raw training 0.001152365
title words 0.0011469359999999999
neighboring words 0.001143151
rule learning 0.001138942
loc nes 0.001129936
name instances 0.001125686
organization names 0.001113015
high performance 0.001111973
name seeds 0.001102169
org names 0.001101773
name types 0.0010966980000000001
organization type 0.00109367
learning string 0.001090245
english parser 0.001083483
proper name 0.001074508
token feature 0.001071729
corpus 0.00106849
hmm learning 0.001061129
learning hmm 0.001061129
system approaches 0.00105914
loc names 0.001057245
numerical nes 0.00105298
bootstrapping system 0.001047836
small list 0.001047706
learning systems 0.001045301
name john 0.001045053
name pos 0.001034697
per names 0.001032579
such patterns 0.001031417
same answer 0.0010226760000000001
overall system 0.001020996
such systems 0.001020843
sample rules 0.00101538
proper names 0.001011721
city name 0.001009053
effective learning 0.001004911
information 0.00100389
antecedent nes 9.99936E-4
supervised learning 9.98182E-4
machine learning 9.96815E-4
name rochester 9.90055E-4
system design 9.86774E-4
context evidence 9.838E-4
tagger type 9.835269999999999E-4
handcrafted rules 9.77011E-4
type precision 9.748319999999999E-4
different techniques 9.73862E-4
homogeneous rules 9.735270000000001E-4
infoxtract system 9.726819999999999E-4
successive learning 9.7053E-4
performance degradation 9.66546E-4
system architecture 9.65837E-4
tag sequence 9.657439999999999E-4
respiratory system 9.63459E-4
model 9.62718E-4
neighboring context 9.620610000000001E-4
iterative learning 9.617510000000001E-4
different views 9.56859E-4
performance enhancement 9.42556E-4
company names 9.40566E-4
same fact 9.31161E-4
same rationale 9.31161E-4
unsupervised learning 9.30541E-4
medicine names 9.269829999999999E-4
common noun 9.25327E-4
rule accuracy 9.229220000000001E-4
language processing 9.172E-4
natural language 9.1508E-4
