mutation model 0.002108915
generative model 0.0021031419999999997
specific model 0.002100528
standard model 0.002094284
present model 0.002078848
attachment model 0.002053732
our model 0.0020451569999999997
transformation model 0.0020304489999999997
parametric model 0.002023709
model likeli 0.002021573
tation model 0.0020177529999999997
model venues 0.0020177529999999997
model 0.00180618
training data 0.0015697150000000002
different name 0.001550697
other models 0.00155058
such models 0.001541682
test data 0.001404257
name string 0.00136784
same probability 0.001321306
name tokens 0.001315961
test name 0.0013127339999999999
other tokens 0.001277663
name type 0.001264281
training time 0.001261355
other names 0.00123852
same entity 0.00120901
training set 0.001203284
common name 0.0011847960000000001
entity tokens 0.0011835840000000001
tree method 0.001183522
wikipedia data 0.001181834
test entity 0.001180357
conditional probability 0.001174418
previous name 0.0011603619999999999
different probabilities 0.001145764
entity names 0.001144441
true language 0.001138981
name pairs 0.001128091
different phylogeny 0.00112616
same time 0.00112455
same tokens 0.001121848
different entities 0.001119202
supervised data 0.0011177280000000001
new name 0.00111739
stochastic models 0.001114599
name phylogeny 0.001110963
different order 0.001098537
name strings 0.001096803
string type 0.0010966209999999999
unsupervised data 0.001096382
probability distributions 0.001092233
wikipedia name 0.001090311
data preparation 0.001089686
edit type 0.001089352
person name 0.001087818
parent language 0.001087539
other way 0.001084237
name types 0.001080544
transduction models 0.001075573
second probability 0.001074705
data quality 0.001072246
different corpora 0.0010702189999999999
transformation models 0.001045397
other types 0.001042246
different operations 0.001040713
test set 0.001037826
name variation 0.001035078
name components 0.001034054
different characters 0.001032759
conditional distribution 0.0010307369999999999
such relationships 0.001026968
ing algorithm 0.00102549
output string 0.001025139
word type 0.001023838
language tags 0.001021124
posterior probability 0.001019336
high probability 0.00101557
different parents 0.001013023
positive probability 0.001012826
low probability 0.001010449
untagged name 0.001007822
frequent name 0.001006651
marginal probability 0.001005785
name matching 0.001005134
true entity 0.0010045
different people 0.001004163
total probability 0.001003826
different variants 0.001003533
name aliases 0.001002639
input string 0.001001243
training collection 9.995240000000001E-4
initial name 9.98638E-4
different amounts 9.98234E-4
different initializations 9.98163E-4
random edit 9.97546E-4
training conditions 9.967180000000002E-4
different occa 9.95617E-4
novel name 9.947369999999999E-4
name vari 9.94276E-4
