human evaluation 0.00326824
translation quality 0.003087678
human translations 0.00296445
evaluation metrics 0.00291514
reference translation 0.002878383
good translation 0.0028693160000000002
machine translation 0.002809062
translation candidate 0.002542441
chine translation 0.002536467
translation problems 0.002529637
human judgments 0.002337577
combination metric 0.002201544
different metrics 0.002181984
bleu metrics 0.002181715
human reading 0.002171226
metric types 0.002162874
translation 0.00215912
human ratings 0.0021582850000000002
semantic analysis 0.0021425850000000002
ensemble metric 0.0021131500000000003
superior metric 0.002103409
metric beats 0.002092577
semantic relatedness 0.001951456
semantic equiv 0.001951456
semantic compatibility 0.001951456
reference translations 0.0019072730000000001
training data 0.001886387
constant evaluation 0.001861626
regression model 0.001748959
metric 0.00171947
linguistic information 0.0017117220000000002
feature set 0.001673056
language sentences 0.001672537
linguistic analysis 0.0016144650000000003
rte features 0.001570958
lexical similarity 0.001559103
linguistic variation 0.001543011
entailment entailment 0.00153154
system level 0.001524487
entailment figure 0.001522789
research projects 0.001521854
evaluation 0.0014918
mismatch features 0.0014917749999999999
quality judgments 0.001489695
rte feature 0.001479848
traditional features 0.001477787
complementary features 0.001475344
automatic measures 0.0014679279999999999
language pairs 0.00145944
linguistic representations 0.001451891
feature combination 0.001450044
labeled data 0.001433496
linguistic analyses 0.0014329940000000002
linguistic evidence 0.001425211
metrics 0.00142334
data sparsity 0.001416002
individual words 0.0013881219999999999
linear regression 0.001383906
stochastic model 0.0013722539999999998
man quality 0.0013703930000000001
feature computation 0.001360857
large bleu 0.001351803
quality prediction 0.00134147
entailment relations 0.00134113
word order 0.0013334430000000001
system hypothesis 0.0013275259999999999
rte system 0.0013209089999999999
quality gap 0.001299578
natural language 0.001286552
logical entailment 0.001218781
main reason 0.00120151
individual sentences 0.0012
textual entailment 0.001192906
translations 0.00118801
entailment recognizer 0.001162213
stanford entailment 0.001162213
different settings 0.001157089
bidirectional entailment 0.0011519260000000002
rich set 0.001140841
entailment status 0.001140305
entailment hyp 0.001140305
ture set 0.001137482
bleu differences 0.0011342750000000001
identical sentences 0.001134028
sess meaning 0.001131379
individual scores 0.001105082
modelling level 0.001089595
good predictors 0.001085603
traditional scores 0.001082986
rte task 0.0010791989999999999
same thing 0.001072232
research 0.00106291
features 0.00105908
art scores 0.001048814
phrase reorderings 0.001035207
novel approach 0.001011103
gument structure 0.001008562
model 0.00100125
analysis india 9.80861E-4
small amounts 9.74466E-4
