record information 0.001958866
record extraction 0.001925506
different record 0.0018272890000000002
compatibility features 0.001653066
clustering fields 0.001638376
same record 0.0016373070000000002
single record 0.001634667
record example 0.001599039
record domain 0.0015829160000000002
learning field 0.001581842
field compatibility 0.001573246
different records 0.001550442
same features 0.0015480469999999999
record dataset 0.001541802
new record 0.0015415210000000001
possible record 0.001512153
record segmentation 0.001496848
canonical record 0.001492257
state field 0.001489595
contact record 0.0014826420000000002
record boundaries 0.001482588
true record 0.001477163
text domain 0.001463416
tact record 0.001454851
separate record 0.001445487
partial record 0.0014433900000000001
record extrac 0.00144318
record increases 0.0014412770000000001
field values 0.001440951
record rij 0.001435986
tect record 0.001435986
text analysis 0.001432908
information extraction 0.001419292
weak features 0.001412874
other fields 0.0014022610000000001
space model 0.001395098
multiple field 0.001384197
count features 0.001368535
tial features 0.001364767
ful features 0.00134867
unstructured text 0.001347198
certain field 0.001341561
city field 0.001333355
free text 0.001325397
field sets 0.001321104
number fields 0.00131992
training data 0.001317628
database records 0.001307907
clustering algorithm 0.0013058829999999999
regression model 0.001304331
state fields 0.001303923
field pleasantville 0.001284965
relation extraction 0.001280921
multiple records 0.0012764299999999998
company field 0.001272532
small records 0.001271669
field univ 0.001267669
feature function 0.001253416
probabilistic model 0.001248658
record 0.00123254
model parame 0.0012252539999999998
several fields 0.001223921
canonical records 0.00121541
contact records 0.001205795
many fields 0.001205669
true records 0.0012003159999999999
multiple fields 0.001198525
tact records 0.0011780039999999999
records one 0.001161798
cohesive records 0.001161076
time information 0.001160871
certain fields 0.001155889
features 0.00114328
adjacent fields 0.001137247
compatibility function 0.00111569
partitioning fields 0.001107727
compatible fields 0.001103084
nearby fields 0.001087014
tiple fields 0.001085217
cent fields 0.001085054
clustering methods 0.001084003
extraneous fields 0.001083916
cluster compatibility 0.001083631
parate fields 0.001083115
vert fields 0.001083115
fields anno 0.001083115
labeled data 0.001076748
field 0.00106346
word tokens 0.001056242
extraction systems 0.001052007
compatibility method 0.001040882
syntactic information 0.001040324
clustering process 0.001030209
present clustering 0.001028334
model 0.00102039
extraction task 0.0010052020000000002
erative clustering 9.97523E-4
cluster figure 9.9471E-4
same cluster 9.78612E-4
contact information 9.76428E-4
