comparable corpus 0.002636979
bilingual corpora 0.00251417
different corpora 0.0024482799999999997
english word 0.0024424959999999997
corpus comparability 0.0024377419999999997
corpus vocabulary 0.002375235
parallel corpora 0.002336802
different words 0.00229279
corpus quality 0.002270013
homogeneous corpus 0.002215407
english words 0.0022006359999999997
external corpus 0.002196998
original corpus 0.002195613
press corpus 0.002191881
parable corpus 0.002170718
improved corpus 0.002167234
improving corpus 0.002165498
ternal corpus 0.002165498
comparable corpora 0.002164199
same data 0.002077871
such corpora 0.00200065
translation pairs 0.0019895679999999997
french words 0.00196341
word pair 0.001957736
english documents 0.001947718
new corpora 0.001911927
bilingual lexicon 0.001887838
corpus 0.00188084
other words 0.001839596
several corpora 0.001783866
word contexts 0.001779169
bilingual dictionary 0.00175753
bilingual document 0.001750921
homogeneous corpora 0.001742627
external corpora 0.001724218
french documents 0.001710492
unbalanced corpora 0.001693861
corpora yield 0.001692169
yields corpora 0.001692169
machine translation 0.001666914
different methods 0.0016295009999999998
correct translation 0.00162796
context vector 0.001598808
translation candi 0.001575293
different evaluation 0.001563703
different languages 0.001560947
english part 0.0015601690000000001
similarity measure 0.001559009
glish words 0.001552442
same number 0.001541527
lexicon extraction 0.0015191739999999999
similarity score 0.001496398
following similarity 0.001465821
bilingual clustering 0.001452072
english vocabulary 0.001442461
bilingual lexi 0.001440679
same vein 0.001434337
bilingual lexicons 0.001423587
corpora 0.00140806
bilingual dendrograms 0.001390121
different lan 0.0013523629999999999
clustering documents 0.001345614
document pairs 0.001344679
natural language 0.0013427299999999999
french part 0.001322943
language processing 0.001318868
parallel fragments 0.0013096079999999999
ing method 0.001303274
glish documents 0.001299524
translation 0.0012897
english parts 0.001271867
possible translations 0.001266371
cluster english 0.001258124
words 0.00125257
comparability measure 0.0012072300000000001
alternative method 0.001205296
french vocabulary 0.001205235
comparable cor 0.00119272
evaluation measure 0.001173811
method con 0.001167891
french dump 0.001158458
correct translations 0.001158007
context vectors 0.0011505159999999999
comparability score 0.001144619
previous work 0.001141558
gual lexicon 0.001138617
comparability scores 0.00113689
multiple translations 0.001135139
standard approach 0.00112704
possible document 0.001091435
first step 0.001081954
lexicon extrac 0.001075986
extraction experiments 0.001066233
con extraction 0.0010520640000000001
context vec 0.001046624
ing information 0.001046192
general approach 0.001027678
icon extraction 0.001022815
small part 0.001017756
language 0.00101531
