translation model 0.00319908
parallel corpus 0.00310766
parallel training 0.0026171199999999997
training data 0.0025980400000000002
translation scores 0.0025130169999999998
machine translation 0.002504358
parallel corpora 0.002457832
translation quality 0.0024263279999999997
translation probabilities 0.0024035759999999997
query translation 0.0023878
translation matrix 0.002363347
translation disambiguation 0.002298924
trained translation 0.0022904479999999996
estimating translation 0.002288307
test data 0.002268575
target language 0.002226284
source language 0.00221568
test corpus 0.002212735
domain corpus 0.002148146
parallel documents 0.002127982
language models 0.0020666
target corpus 0.002047954
evaluation data 0.0020399150000000002
translation 0.00202808
respective language 0.001988678
corpus figure 0.001913442
training corpora 0.0018923719999999998
different training 0.001878031
purpose parallel 0.001866013
custom parallel 0.001852316
springer corpus 0.0018109749999999998
corpus figures 0.001797855
best corpus 0.001784458
language 0.0016947
parallel 0.00159129
training documents 0.0015625219999999998
different test 0.001548566
training set 0.001529206
corpus 0.00151637
available training 0.001489186
ibm model 0.001482043
okapi model 0.00146865
target corpora 0.001398126
training resources 0.001374415
entire training 0.001346369
other words 0.001342388
appropriate training 0.001326241
springer training 0.001320435
document score 0.0012746670000000002
all corpora 0.001255152
domain news 0.0012517700000000001
test documents 0.001233057
similarity score 0.0012320500000000002
test set 0.001199741
simple method 0.001195598
model 0.001171
information retrieval 0.001160288
clir method 0.001159352
document quality 0.001141695
alignment algorithm 0.001136176
corpora best 0.00113463
corpora characteristics 0.001129351
corpora separately 0.00112649
average document 0.0011235300000000002
document selection 0.001099776
system development 0.001090165
document length 0.001076466
test collection 0.001072737
target documents 0.001068276
individual document 0.0010676890000000001
medical domain 0.0010486
standard performance 0.001028492
training 0.00102583
other domains 0.0010134929999999999
true test 0.001013197
other parameter 0.00101122
lemur system 0.001009012
other areas 0.001000236
available documents 0.001000048
springer test 9.909699999999999E-4
similar documents 9.80651E-4
cosine similarity 9.703310000000001E-4
similarity metrics 9.6937E-4
large collection 9.580960000000001E-4
particular domain 9.5352E-4
mutual information 9.5338E-4
domain match 9.41065E-4
several parameters 9.337530000000001E-4
news wac 9.26677E-4
wac news 9.26677E-4
performance measure 9.17508E-4
news europarl 9.15635E-4
news stories 9.12033E-4
statistical machine 9.101210000000001E-4
target collection 9.079559999999999E-4
weighting strategies 8.97156E-4
related work 8.91854E-4
results section 8.89006E-4
such applications 8.78895E-4
corpora 8.66542E-4
