document clustering 0.00211273
same language 0.002001376
document features 0.001881956
different features 0.001842401
same cluster 0.0017813759999999999
multilingual clustering 0.0017613759999999998
features translation 0.001733961
clustering algorithm 0.0017216760000000001
clustering results 0.00172065
document word 0.0016908280000000001
multilingual cluster 0.0016820659999999999
multilingual document 0.001660486
feature translation 0.001652103
multilingual corpus 0.001553647
cluster similarity 0.0015417389999999999
anchor language 0.001512818
text features 0.001508572
monolingual clustering 0.001507697
news cluster 0.001498646
translation approach 0.001483752
chor language 0.001476491
translation system 0.001474385
human clustering 0.0014681709999999999
different corpora 0.0014591629999999999
different languages 0.001452205
several document 0.001430795
clustering steps 0.001423904
machine translation 0.001422511
clustering solution 0.00141825
manual clustering 0.001392131
independent clustering 0.00138416
document rep 0.001380651
gual clustering 0.0013586989999999999
multilingual documents 0.001352547
translation process 0.0013495260000000002
clustering phase 0.001347911
document frequency 0.001341987
multilingual clusters 0.001337813
translation systems 0.0013287470000000001
final cluster 0.001304579
whole document 0.001302867
monolingual corpus 0.001299968
new features 0.00129858
results clusters 0.0012970870000000002
clusters results 0.0012970870000000002
parallel corpus 0.0012684
different type 0.001265181
free document 0.001262076
document represen 0.001260109
cluster adjustment 0.001258672
gual document 0.001257809
first clusters 0.001255932
language 0.0012475
same corpora 0.0012466740000000001
similar documents 0.001245884
document contents 0.001245049
tilingual document 0.001242008
different lan 0.001240176
other documents 0.0012295779999999998
different strategies 0.001207541
first approach 0.001198512
text representation 0.001197441
different solu 0.0011948689999999999
base features 0.001185017
comparable corpus 0.001180052
articles corpus 0.0011648820000000001
other approach 0.001157424
clusters news 0.001154393
ish corpus 0.001127261
multilingual news 0.001125712
extract features 0.00112566
proper translation 0.0011182190000000002
correct translation 0.001113419
same person 0.001112151
selected features 0.001107705
clustering 0.00110681
new clusters 0.001105791
vant features 0.001104631
same entity 0.0011031650000000001
same category 0.001099065
chine translation 0.001088023
translation tech 0.00108716
translation technolo 0.0010863560000000001
systran translation 0.0010863560000000001
monolingual clusters 0.001084134
multilingual classification 0.001053889
single word 0.001051353
semantic similarity 0.001041841
feature transla 0.001028121
cluster 0.0010275
feature selec 0.001023367
other languages 0.001017437
classification system 0.001015783
same procedure 0.001015485
total clusters 0.001012494
source word 0.001010871
document 0.00100592
multilingual resources 9.98455E-4
information retrieval 9.982910000000001E-4
knowledge representation 9.85409E-4
