oov word 0.0032014879999999997
same word 0.0030563459999999997
dictionary word 0.0027728559999999998
word alignment 0.00269958
inv word 0.002671341
oov words 0.002470318
translation arabic 0.002127862
training data 0.002085271
inv words 0.001940171
test data 0.001901364
different oov 0.001829528
morphological features 0.001788178
arabic news 0.0017641760000000001
linguistic data 0.0016837599999999999
words 0.00168231
arabic tokens 0.0016634010000000001
data sets 0.001600266
language model 0.001586003
baseline phrase 0.001563508
arabic linguistic 0.00155841
data sparsity 0.00154388
mteval data 0.001536793
data consortium 0.001525349
error training 0.0014702740000000001
different techniques 0.001462833
arabic script 0.001455617
baseline system 0.001453612
different spelling 0.001437236
arabic entries 0.001433567
buckwalter arabic 0.001421677
arabic treebank 0.001419961
arabic tokenizer 0.001398785
different letters 0.001397635
inflectional features 0.001374619
source language 0.001371851
parallel text 0.0013633220000000001
baseline bleu 0.00135862
english phrase 0.001356745
different tokenization 0.001348716
large set 0.001322576
translation pair 0.001316794
logical features 0.001315283
different issues 0.001308439
different sen 0.0013061029999999999
same english 0.001288524
correct translation 0.001285155
machine translation 0.001282695
oov tokens 0.001275559
target language 0.001268614
different origins 0.001264811
translation prob 0.00125171
translation probabilities 0.0012475820000000001
low translation 0.001245809
rich language 0.001199095
error analysis 0.001179057
arabic 0.00117585
token oov 0.001165087
new phrase 0.00116244
trigram language 0.0011530450000000001
english tokens 0.001133209
language morpho 0.001123158
language unigrams 0.0011142230000000001
oov handling 0.001092513
phrase table 0.001089444
morphological oovs 0.001083517
erage baseline 0.00108132
morphological preprocessing 0.001076754
oov instances 0.001076136
other languages 0.001075387
oov cases 0.001074746
features 0.00107171
online oov 0.0010710939999999999
oov rate 0.001055505
english transliteration 0.00104798
different 0.00104152
english smt 0.001039926
possible translations 0.0010314859999999999
english corpora 0.001026197
morphological variations 0.00102325
oov rates 0.001017165
ticipate oov 0.001009787
english preprocessing 0.001005944
transliteration system 0.0010035130000000001
morphological variant 0.001000056
morphological matching 9.97485E-4
smt system 9.95459E-4
morphological variants 9.865380000000001E-4
morphological expansion 9.831010000000001E-4
bleu results 9.80663E-4
inv phrase 9.689480000000001E-4
original phrase 9.607179999999999E-4
phrase tables 9.596839999999999E-4
morphological inflection 9.58768E-4
other approaches 9.53779E-4
rule set 9.5203E-4
translation 9.52012E-4
morphological dis 9.496250000000001E-4
large list 9.25492E-4
english transla 9.21013E-4
raw text 9.197739999999999E-4
