translation similarity 0.003029478
translation words 0.002783481
translation word 0.002775924
bilingual language 0.00270507
language model 0.002517288
parallel web 0.00245044
language pairs 0.002433363
machine translation 0.002405513
external translation 0.002352182
internal translation 0.002342433
final translation 0.002326791
bilingual web 0.00231611
translation simi 0.002298873
hanced translation 0.002287767
same language 0.002280741
other language 0.002279973
different language 0.002279802
web page 0.002247063
english page 0.0022291430000000003
parallel page 0.0021194630000000002
chinese page 0.002111903
language identification 0.002061703
translation 0.00205455
web mining 0.0020273
web pages 0.002009428
english pages 0.001991508
bilingual page 0.001985133
natural language 0.001979617
language processing 0.001960288
language versions 0.001952321
parallel pairs 0.001916803
language inde 0.001912872
mining parallel 0.0018997
different web 0.0018908420000000002
bilingual data 0.0018881800000000002
parallel pages 0.0018818279999999999
chinese pages 0.0018742680000000001
web documents 0.0018430780000000002
english document 0.001840895
parallel sentence 0.001823785
chinese document 0.0017236550000000002
parallel documents 0.0017154779999999999
page pairs 0.001713426
language 0.00167798
monolingual web 0.0016779400000000002
government web 0.0016489590000000002
candidate page 0.001648601
parallel sentences 0.001634029
parallel corpora 0.001614756
web site 0.001590559
effective web 0.0015905140000000001
typical web 0.00156418
parallel resources 0.001560828
multilingual web 0.0015518720000000001
lingual web 0.00153959
web sites 0.001533936
web min 0.0015229940000000002
web spider 0.0015205260000000001
page pair 0.001512747
english homepage 0.0015054110000000001
words pairs 0.001484314
bilingual websites 0.001476875
word pairs 0.001476757
page set 0.00147441
parallel resource 0.001459196
pages mining 0.001458688
other information 0.001448989
candidate pairs 0.001445941
bilingual website 0.001434865
bilingual resources 0.001426498
sentence pairs 0.001417748
url similarity 0.001403818
parallel homepages 0.001394911
chinese homepage 0.0013881710000000001
bilingual dictionary 0.001387652
chinese homepages 0.001387351
similarity values 0.001382038
bilingual lexicon 0.001367087
structure similarity 0.001345902
similarity measures 0.001341724
mining system 0.001340529
page structure 0.001329017
tion similarity 0.001328786
document pairs 0.001325178
bilingual resource 0.001324866
structural similarity 0.001300488
page weight 0.0012998810000000001
pair mining 0.001292984
page size 0.001281843
bilingual lexicons 0.001278536
similarity score 0.001276373
top page 0.001275086
external similarity 0.00127256
english 0.0012711
error page 0.001265935
internal similarity 0.001262811
certain page 0.001260055
bilingual lexi 0.001259554
final similarity 0.001247169
similarity input 0.001240258
