training set 0.001710829
training data 0.001673098
data set 0.0015177469999999998
large training 0.001499341
such features 0.00146934
training time 0.0014403950000000001
learning algorithm 0.001402281
regex learning 0.001393756
other regex 0.001333245
small training 0.001314175
limited training 0.001212537
average training 0.001203278
feature extractor 0.0011823509999999999
training dataset 0.00117805
learning problem 0.00116839
ing set 0.001163496
training testing 0.001159414
testing training 0.001159414
number extraction 0.001157781
phonenumbertask training 0.0011558150000000001
ing algorithm 0.001149476
extra feature 0.001147214
new regex 0.001145268
powerful feature 0.001144985
crf algorithm 0.001143107
training phase 0.001141844
algorithm figure 0.001124713
test set 0.00111934
regex transformation 0.001106925
candidate regex 0.001097263
original regex 0.001095505
input regex 0.001094785
final regex 0.001081263
output regex 0.001076478
information extraction 0.00106813
complex features 0.00105051
data sets 0.001049014
level features 0.001046486
labeled data 0.001044805
regex transformations 0.001040507
learning algorithms 0.0010378689999999999
novel regex 0.00103769
learning task 0.001034258
large number 0.00102842
useful features 0.001025347
extraction tasks 0.0010251729999999999
regex result 0.0010243510000000002
valid set 0.001022936
entity extraction 0.001013927
such entities 0.001008878
current regex 0.001008613
validation set 0.00100715
base features 0.001006266
appropriate features 0.001003884
quality regex 0.001001935
infinite set 9.9909E-4
relie algorithm 9.98608E-4
initial regex 9.932040000000001E-4
regex extractors 9.91077E-4
target regex 9.88687E-4
such extractions 9.85683E-4
mation set 9.83056E-4
incorporated features 9.80903E-4
structural features 9.80903E-4
regex rnew 9.79707E-4
such restrictions 9.79571E-4
such complex 9.709E-4
tial regex 9.69576E-4
validation data 9.69419E-4
objective function 9.69349E-4
algorithm ttotal 9.692400000000001E-4
learning approaches 9.676509999999999E-4
regex languages 9.65634E-4
machine learning 9.649979999999999E-4
modern regex 9.644670000000001E-4
regex engine 9.62934E-4
regex raρ 9.6246E-4
powerful regex 9.621390000000001E-4
improved regex 9.61658E-4
regex engines 9.61658E-4
compact regex 9.60895E-4
java regex 9.602530000000001E-4
regex pat 9.602530000000001E-4
instantiating regex 9.602530000000001E-4
setup data 9.57078E-4
data increases 9.46772E-4
extraction quality 9.423529999999999E-4
large class 9.385630000000001E-4
feature 9.3804E-4
training 9.3309E-4
name extraction 9.195099999999999E-4
extraction qualitya 9.17817E-4
automatic learning 9.14046E-4
raw extraction 9.0555E-4
other machine 9.04487E-4
robust extraction 9.03119E-4
dresses extraction 9.00882E-4
such thatra 9.00753E-4
maximum number 8.890009999999999E-4
same task 8.888940000000001E-4
