unknown word 0.003180063
unknown words 0.003015563
statistical model 0.002874453
tags words 0.0028459210000000004
other words 0.002830988
new word 0.0027911589999999997
word tagging 0.002787731
chinese word 0.0027701939999999997
model figure 0.002749507
word distribution 0.0027410959999999997
entropy model 0.002738165
model the 0.002686212
hybrid model 0.0026780479999999997
trigram model 0.00267134
word order 0.002651713
individual model 0.002651712
combined model 0.0026279199999999997
model guesses 0.002620569
word tokens 0.0026108449999999997
tistical model 0.002609987
free word 0.0026003669999999997
word hav 0.002599059
word identification 0.002599059
word abc 0.002599059
word resolution 0.002599059
word delimiters 0.002599059
model comple 0.002596906
component words 0.002563024
rare words 0.002489966
monosyllabic words 0.0024753180000000002
words xiaofei 0.002473739
known words 0.002467987
loan words 0.0024593650000000002
disyllabic words 0.002450579
syllabic words 0.002443989
words chars 0.002439358
trisyllabic words 0.002435874
align words 0.002435272
reduplicated words 0.0024344930000000002
model 0.00232578
words 0.00216407
unknown pos 0.0017554430000000002
pos information 0.001606246
other models 0.00159012
pos tags 0.001585801
such tags 0.001517937
pos tag 0.0014888140000000002
statistical models 0.001471875
training corpus 0.0014368039999999999
training data 0.001418654
data test 0.0014072239999999999
test data 0.0014072239999999999
different models 0.0013964070000000001
information structure 0.00135052
character strings 0.001333818
pos context 0.0013225070000000001
pos categories 0.0013074010000000001
previous pos 0.001301627
hybrid models 0.00127547
learning information 0.0012712510000000002
pos category 0.001267681
pos probabilities 0.001266212
second language 0.0012642550000000002
training test 0.001259834
particular pos 0.00125426
individual models 0.001249134
data types 0.001246115
chinese corpus 0.001242796
tistical models 0.001207409
ing data 0.001196764
sign language 0.001196698
dividual models 0.001195295
pos guess 0.001194115
pos cate 0.001188033
unknown nouns 0.001184143
pos cat 0.0011794610000000001
pos guessing 0.001174871
natural language 0.001145786
rule tags 0.001144613
other types 0.001130011
noun morphemes 0.001112183
separate set 0.001098951
language thamar 0.001098388
language question 0.001094776
noun morpheme 0.001082957
nese corpus 0.00108044
language generation 0.001075274
language generator 0.001068731
chars data 0.00105831
speech recognition 0.001053989
internal structure 0.001045979
useful information 0.0010423160000000002
noun figure 0.001034654
complete set 0.001001756
training lexicon 0.001001579
contextual information 9.9914E-4
semantic information 9.81921E-4
different segmentation 9.6628E-4
total training 9.638260000000001E-4
student workshop 9.58586E-4
