parallel text 0.0025832299999999997
language data 0.00233536
parallel texts 0.002189939
parallel documents 0.00212183
language pairs 0.001982039
english word 0.001905394
machine translation 0.0018731440000000002
translation systems 0.001859795
language pair 0.001839134
text pairs 0.001833389
english text 0.0018109340000000002
text pair 0.001690484
parallel inputs 0.001686497
foreign language 0.001574463
language learning 0.001555765
candidate text 0.001545813
sentence level 0.00154284
input language 0.001540629
language exercise 0.001529698
training data 0.00152897
surprise language 0.0015287909999999998
particular language 0.001521551
text length 0.00150682
translation 0.00148685
such texts 0.001486517
word count 0.001477251
parallel 0.0014493
allel text 0.001427572
text genres 0.001415037
line search 0.001386443
text lengths 0.001380324
text genre 0.001375135
text doc 0.001372029
plain text 0.001372029
strand data 0.001362959
typical alignment 0.001312482
same pair 0.0013122160000000002
such corpora 0.001303856
alignment algorithms 0.0013030300000000002
space search 0.001285328
language 0.00128258
document pairs 0.001276673
search algorithm 0.0012231920000000001
strand corpus 0.001217224
such chain 0.001206507
search rectangle 0.001198593
sentence 0.00118997
test pairs 0.0011693390000000001
similarity score 0.001164779
comparable texts 0.001160572
suitable corpus 0.001151318
bitext map 0.001144555
texts simr 0.001136871
same length 0.0011285520000000001
search repeats 0.001125749
candidate pairs 0.001111342
same topic 0.001110874
bilingual dictionary 0.001103647
bitext mapping 0.001082956
english segment 0.001064574
last points 0.0010356200000000001
allel texts 0.001034281
same structure 0.001029745
alignment 0.0010169
training set 0.001016788
test set 0.001010478
same optimization 0.0010094050000000001
good bitext 9.97529E-4
same ideas 9.96434E-4
spurious points 9.94706E-4
bitext space 9.93359E-4
potential points 9.90765E-4
simr bitext 9.90093E-4
other approaches 9.84611E-4
strand documents 9.82709E-4
lel texts 9.82641E-4
unrelated texts 9.810819999999999E-4
ble texts 9.78345E-4
other applications 9.78284E-4
candidate pair 9.684369999999999E-4
first step 9.66342E-4
allel documents 9.66172E-4
different character 9.63584E-4
parameter set 9.563779999999999E-4
training training 9.5238E-4
bipartite matching 9.50328E-4
large number 9.48222E-4
interior points 9.43061E-4
english side 9.42483E-4
corpus 9.07045E-4
other hand 9.05566E-4
search 8.8583E-4
document features 8.81749E-4
new application 8.71085E-4
optimal threshold 8.6972E-4
true bitext 8.68192E-4
common subsequence 8.63889E-4
new avenues 8.56601E-4
new parame 8.56601E-4
smt training 8.53285E-4
