training data 0.002174919
annotation data 0.002067482
data set 0.001922797
language data 0.0018498860000000002
learning algorithm 0.001783517
ing data 0.0017332810000000001
feature weights 0.001725913
feature functions 0.001698606
labeled data 0.0016977950000000002
additional data 0.001687686
model performance 0.001648999
corpus entity 0.001648759
common feature 0.0016365260000000001
training set 0.001619276
small data 0.0015922780000000001
linguistic data 0.001550443
weak model 0.001535797
data quality 0.001520066
structured data 0.0015175710000000001
crf model 0.001512491
function algorithm 0.001503426
data points 0.001490632
sequential data 0.0014871720000000001
label sequence 0.00148563
deteriorated data 0.0014778130000000001
new model 0.0014769050000000001
quential data 0.001476693
data pollution 0.001473902
data consortium 0.001473902
overall model 0.001414977
current model 0.001400589
active learning 0.001398587
labeled training 0.001394274
maximum model 0.001391896
different entity 0.0013890579999999999
corpus tokens 0.001387097
learning task 0.001370222
learning curves 0.001354937
high training 0.001352672
feature 0.00134238
rent model 0.001339376
respective model 0.001336631
model parameters 0.001331031
sequence examples 0.0013212089999999998
labeled corpus 0.001320308
machine learning 0.001319831
initial model 0.001319228
same time 0.001309594
annotated corpus 0.0013094860000000001
cost model 0.001307984
poor model 0.0012982410000000001
overall training 0.001297956
model expecta 0.001289004
random selection 0.0012854329999999999
respective learning 0.001282491
learning scheme 0.001261155
different selection 0.001239655
training utility 0.001238446
passive learning 0.001235123
learning protocol 0.001233856
limited training 0.001221361
training material 0.001211358
training part 0.001209772
many sequence 0.001201936
abstracts corpus 0.001190356
pennbioie corpus 0.001183521
sequence labeling 0.001177822
ing set 0.001177638
training util 0.001175139
entity class 0.001174825
precious training 0.001170447
manual annotation 0.001162633
full annotation 0.001155071
class label 0.0011490189999999998
annotation costs 0.0011479749999999999
terms annotation 0.001146998
other tokens 0.001114541
possible label 0.001113339
corpus fmax 0.001110978
annotation effort 0.001103735
annotation rate 0.001103209
reference corpus 0.0010974490000000001
plete corpus 0.0010974490000000001
label sequences 0.001096045
real annotation 0.001093987
example selection 0.001088213
conditional random 0.001084439
annotation cost 0.001083526
entity classes 0.001080907
annotation point 0.001080684
evaluation set 0.00107766
actual annotation 0.001069776
sesal approach 0.001069638
annotation pur 0.001064054
annotation campaigns 0.001064054
entity sub 0.001061039
unlabeled examples 0.001058054
labeled examples 0.001055374
common approach 0.001053602
model 0.00105272
