word segmentation 0.002503651
query retrieval 0.0022848
segmentation retrieval 0.002063741
simple word 0.002061053
many words 0.00199492
chinese words 0.0019827539999999998
information retrieval 0.00195368
accurate word 0.001892869
short word 0.001891874
few words 0.0018149099999999999
common words 0.001806437
text segmentation 0.001762853
initial retrieval 0.001748854
short words 0.001726544
character words 0.001719639
retrieval results 0.00171921
retrieval algorithm 0.001706793
crucial words 0.001691677
initial query 0.001674154
query term 0.001665335
query expansion 0.00163998
segmentation method 0.001579623
good retrieval 0.001569965
trec query 0.001562327
second retrieval 0.001511755
words 0.00145433
many segmentation 0.0014245809999999999
retrieval effectiveness 0.00141724
powerful retrieval 0.0014102960000000001
admirable retrieval 0.0014087470000000001
retrieval conference 0.001405671
initial lexicon 0.0013890320000000001
raw query 0.001372903
chinese information 0.001302354
chinese language 0.001297768
other cases 0.00129224
other analysis 0.001286505
other processing 0.001229112
simple language 0.001210737
lexicon size 0.001187104
segmentation patterns 0.001186975
further segmentation 0.001183253
retrieval 0.00117975
other applications 0.0011693340000000002
other tools 0.00116766
small lexicon 0.001162239
segmentation purposes 0.00115915
accurate segmentation 0.0011572
different number 0.001153743
wrong segmentation 0.001141934
machine translation 0.001140468
final lexicon 0.001139183
other researchers 0.001136367
lexicon list 0.0011267220000000001
common language 0.001121451
chinese collection 0.00111068
segmentation marker 0.001109218
free text 0.001107358
query 0.00110505
lexicon effects 0.0010881040000000001
language usage 0.001082969
lexicon lookup 0.001055436
average precision 0.001039889
trec evaluation 0.001015751
relevant documents 0.001015474
test corpus 0.001012898
document collection 0.001012252
same characters 0.001010845
usage rules 0.001009259
document term 9.90281E-4
congress rule 9.71379E-4
english content 9.64075E-4
bigram method 9.520749999999999E-4
igent rule 9.41825E-4
mistake rule 9.3745E-4
good results 9.29675E-4
many cases 9.26897E-4
system stopwords 8.85657E-4
long documents 8.84838E-4
segmentation 8.83991E-4
term weights 8.741409999999999E-4
term usage 8.7391E-4
trec experiments 8.628170000000001E-4
first step 8.62177E-4
new terms 8.61669E-4
chinese experiment 8.49853E-4
original trec 8.48779E-4
content terms 8.431770000000001E-4
dictionary entries 8.414000000000001E-4
pircs system 8.39314E-4
lexicon 8.19928E-4
chinese topics 8.194809999999999E-4
small set 8.16263E-4
many purposes 8.157489999999999E-4
translation 8.06075E-4
first pass 8.05115E-4
short queries 8.046010000000001E-4
experimental results 7.91288E-4
global term 7.86438E-4
single characters 7.77715E-4
