training data 0.0038755200000000004
word training 0.0034343000000000004
training set 0.002923624
different data 0.0029002200000000002
data test 0.0027946430000000003
training corpus 0.002749036
large training 0.002642719
unlabeled training 0.00262948
size training 0.002581311
training size 0.002581311
small training 0.002540169
training time 0.002511487
new training 0.002496843
labeled training 0.002477242
training instance 0.002452653
training samples 0.00244916
additional training 0.00244573
large data 0.002441279
training instances 0.002438345
single training 0.0024370060000000002
tagger training 0.002430251
unlabeled data 0.0024280400000000002
particular training 0.002418524
annotated training 0.002411074
training corpora 0.002410392
training sets 0.002393469
potential training 0.0023915720000000002
data size 0.002379871
unannotated training 0.002366904
unsupervised training 0.002365653
total training 0.002352326
source training 0.002343239
consecutive training 0.002342385
original training 0.002341672
small data 0.0023387290000000003
training biases 0.002313414
training material 0.0022938150000000003
sparse training 0.002291986
training collections 0.002289746
labeled data 0.002275802
additional data 0.00224429
annotated data 0.0022096340000000003
data sets 0.002192029
data sources 0.002160496
different learning 0.0021491
data seed 0.002129207
training 0.00203848
word sense 0.001952991
test set 0.001842747
learning methods 0.001781412
learning algorithm 0.0017062499999999999
word seed 0.0016879870000000002
target word 0.001674685
test accuracy 0.001630481
standard learning 0.001593041
supervised learning 0.001580493
learning experiments 0.001558551
set accuracy 0.001558022
new learning 0.001544283
machine learning 0.001531251
different machine 0.001508511
different labels 0.001493849
unlabeled set 0.001476144
same test 0.00147212
active learning 0.001461011
test sample 0.0014358980000000001
set size 0.0014279750000000002
unsupervised learning 0.001413093
learning charniak 0.001400492
learning algorithms 0.001391554
test sentence 0.001391158
learning approaches 0.001390681
small set 0.001386833
different sources 0.001386636
learning curve 0.001384725
learning techniques 0.00138327
learning curves 0.001340225
different percentages 0.001317756
recent set 0.001309007
unlabeled corpus 0.0013015560000000002
agreement test 0.001285782
model probabilities 0.001267405
set disambiguation 0.0012548210000000001
corpus size 0.001253387
classification label 0.001237219
classification accuracy 0.001204376
words winnow 0.001188281
such problems 0.001181933
set increases 0.001181256
confusable words 0.00118056
language classification 0.001169705
confusion set 0.001168295
set members 0.0011575980000000001
average performance 0.001150989
human annotation 0.001144928
performance improvement 0.001110838
such cases 0.001098923
general problem 0.001089175
learning 0.00108592
annotated corpus 0.00108315
