translation data 0.00382174
training data 0.00342728
data selection 0.003088011
translation model 0.00299228
translation annotation 0.0029704600000000003
data figure 0.002696132
new data 0.002658018
ing data 0.002638962
unlabeled data 0.002638857
additional data 0.002579699
labeled data 0.002569743
ldc data 0.002563761
seed data 0.0025210420000000002
linguistic data 0.002453715
sentence translation 0.002445804
crawl data 0.002441794
data consortium 0.002434377
phrase translation 0.002431055
translation time 0.002300581
translation models 0.002217749
annotation cost 0.002196454
translation quality 0.002178853
new translation 0.0021469180000000003
human translation 0.002108006
other annotation 0.002045022
machine translation 0.002044509
translation pairs 0.002034362
low translation 0.002017177
translation speeds 0.001997105
training system 0.001993144
translation pair 0.001963065
annotation time 0.001960401
augmented translation 0.001954621
translation speed 0.0019537590000000002
translation effort 0.0019501190000000002
tence translation 0.001945691
different sentences 0.0019332540000000001
jhiero translation 0.00192377
single word 0.00188604
word alignment 0.001821913
words annotated 0.001815652
human annotation 0.0017678260000000001
annotation costs 0.001757309
new training 0.001752458
random selection 0.001749212
selection algorithm 0.001735329
foreign word 0.00172358
urdu word 0.001717248
sentence selection 0.001712075
word aligner 0.001707764
available training 0.0016840709999999999
annotation times 0.001682932
selection method 0.001673436
word alignments 0.001671433
labeled training 0.0016641829999999999
translation 0.00165532
ent annotation 0.001652799
other selection 0.001651473
different methods 0.0016366610000000002
english language 0.0016105310000000001
annotation effort 0.001609939
sus annotation 0.001584229
equal annotation 0.001584229
selection methods 0.001525322
test set 0.0015185939999999998
foreign words 0.0014919599999999999
urdu words 0.001485628
trigger words 0.001483985
unseen words 0.001482951
selection figure 0.0014513030000000001
unknown words 0.0014494299999999998
lated words 0.001435099
selection strategy 0.001402634
sentence translations 0.0013374760000000002
model 0.00133696
phrase translations 0.001322727
annotation 0.00131514
different conclusions 0.0013116640000000001
different thing 0.0013046660000000001
training 0.00126086
full corpus 0.001260751
tion cost 0.001258782
measure cost 0.001255539
ldc corpus 0.001250252
longest selection 0.001221297
tence selection 0.001211962
accurate cost 0.001211361
selection jsyntax 0.00120624
large set 0.001204826
selection jhier 0.00120253
dom selection 0.0012018200000000001
full sentence 0.0011983240000000002
tion algorithm 0.001191206
inferior selection 0.001190264
total cost 0.0011890450000000001
additional phrase 0.001189014
entire sentences 0.001180547
cost metrics 0.001176257
grams sentences 0.001173884
complete sentences 0.0011726739999999999
