parallel data 0.00301911
parallel web 0.002668575
mining parallel 0.002618309
parallel text 0.002479893
parallel corpus 0.002465588
parallel pair 0.002464274
parallel page 0.002457893
bilingual data 0.00239861
parallel document 0.002388785
tree alignment 0.00238418
parallel documents 0.002371803
parallel pages 0.002311347
parallel sentences 0.002282676
tree model 0.0022139580000000002
parallel corpora 0.002192854
language pairs 0.002182184
node translation 0.002149891
parallel dom 0.002148144
web data 0.002133925
parallel content 0.00213106
sentence alignment 0.002116507
alignment model 0.002110578
parallel sen 0.002085783
data mining 0.002083659
parallel hyperlinks 0.002063904
parallel hyper 0.002059466
chinese translation 0.0020575560000000003
scale parallel 0.00205029
bilingual web 0.002048075
parallel docu 0.002028572
text translation 0.002023183
act parallel 0.002020878
tree pairs 0.001952904
translation probability 0.001936484
tree alignments 0.001912901
different translation 0.001896437
document tree 0.0018556850000000001
alignment method 0.001807843
machine translation 0.0017842280000000001
parallel 0.00177688
chinese word 0.001761856
tree nodes 0.001757892
syntactic tree 0.00173901
web mining 0.001733124
language generation 0.001719541
sentence pairs 0.0016852310000000001
alignment models 0.001657505
translation sys 0.001646494
dom tree 0.0016150440000000002
bilingual websites 0.001605289
bilingual website 0.0015931
alignment score 0.001593055
tree substitution 0.001579852
web page 0.001572708
bilingual corpora 0.001572354
translation prob 0.001565228
data acquisition 0.001556968
tree align 0.001547313
tree similarities 0.001528908
optimal tree 0.0015287450000000002
data consortium 0.001525231
alignment performance 0.00151262
web document 0.0015036
synchronous tree 0.00148988
tree hierar 0.001487927
web documents 0.0014866179999999999
guistic data 0.001486441
language 0.00147306
mining system 0.001465753
alignment accuracy 0.0014616289999999999
source sentence 0.001450422
alignment task 0.001442309
web pages 0.001426162
tence alignment 0.001409217
mining process 0.001404311
page pairs 0.001390137
alignment configurations 0.001387696
alignment support 0.001387273
alignment configuration 0.001385395
new web 0.001384172
target node 0.0013691469999999998
page pair 0.0013684069999999999
mining results 0.0013663149999999999
new mining 0.001333906
length model 0.001329373
document pairs 0.001321029
translation 0.00132017
ment model 0.001305994
ibm model 0.001301382
document pair 0.001299299
verification model 0.001291797
sentence align 0.00127964
model the 0.001277955
output sentence 0.001275459
mining approach 0.001275324
previous mining 0.001275249
word breaker 0.00127028
sentence aligner 0.001269822
object model 0.001259676
candidate web 0.001258908
