word character 0.006680810000000001
word model 0.00620651
other word 0.005323515
word segmentation 0.005237690000000001
japanese word 0.005234133
unknown word 0.005214634
word context 0.0050344230000000005
accuracy word 0.005023396
word length 0.0049366010000000005
previous word 0.004927310000000001
word type 0.004914542
word spelling 0.004912153000000001
word models 0.004908598
sentences word 0.004865444
word problem 0.0048464400000000005
above word 0.004831971
word types 0.004829656
word sequence 0.004814767
tags word 0.004804323
word tags 0.004804323
word char 0.0047842430000000005
word mode 0.004775619
dictionary word 0.00475903
word bigram 0.004753525000000001
word ngram 0.004744919
word morphology 0.0047440270000000005
word tokens 0.004743886
empirical word 0.004736397000000001
average word 0.004732545
known word 0.00472933
word bigrams 0.004722093
distinct word 0.004719165
word segmentat 0.004709860000000001
word symbol 0.004706234
word seg 0.004705679
word mod 0.004705665
word segmenta 0.0047047980000000005
word segmenter 0.004700661
word lengths 0.004691974000000001
words character 0.0043814
japanese character 0.0031649029999999997
speech character 0.0030562429999999997
other words 0.003024105
test words 0.0029547799999999997
japanese words 0.0029347229999999998
character set 0.0029204879999999997
unknown words 0.002915224
different character 0.002896868
character length 0.0028673709999999996
character type 0.002845312
character string 0.0027995779999999996
character types 0.002760426
character sequence 0.0027455369999999997
chinese character 0.002744174
single character 0.00274087
common character 0.002736297
character sets 0.0027191019999999997
segmentation model 0.00269416
character bigram 0.0026842949999999997
fixed character 0.002669
character unigram 0.0026596199999999997
character bigrams 0.0026528629999999997
distinct character 0.002649935
words length 0.0026371909999999997
character perplexity 0.002635884
character zerogram 0.002628685
character unigrams 0.002626153
tinct character 0.002623664
ent character 0.002623664
character perplex 0.002623664
language model 0.002620109
long words 0.0025374309999999997
function words 0.002525525
katakana words 0.0024841489999999997
first model 0.002457021
hapax words 0.002444046
frequent words 0.002437598
origin words 0.002434871
known words 0.00242992
kanji words 0.002423107
loan words 0.002420518
words rec 0.002420103
distinct words 0.002419755
foreign words 0.002417827
compound words 0.002398766
infrequent words 0.002398468
length model 0.002393071
quent words 0.002392582
spelling model 0.0023686230000000003
character 0.00230579
second model 0.0022953970000000002
baseline model 0.002273572
statistical model 0.002269011
markov model 0.002231083
bigram model 0.002209995
third model 0.002205929
brown model 0.002188909
unigram model 0.00218532
guage model 0.002165723
separate model 0.002148838
