unknown word 0.0027850600000000002
new words 0.002758385
current word 0.002748052
word prediction 0.002729703
other words 0.002654073
known word 0.0026112260000000003
word predictor 0.002596277
word endings 0.0025954430000000002
unknown words 0.0023476
problem words 0.0023120709999999997
known words 0.0021737659999999997
training data 0.001967272
words 0.0018796
statistical data 0.001614225
training corpus 0.001604395
morphological process 0.001545
hmm model 0.001501384
separate data 0.001495841
morphological infor 0.0014077360000000001
training set 0.0013906460000000002
markov model 0.001389541
correct tag 0.001377475
current tag 0.001374662
full tag 0.00136563
tag distribution 0.001351993
tag occurrences 0.0013142219999999999
suffix information 0.0013012269999999999
new knowledge 0.0012823399999999999
hmm tagger 0.001280741
other methods 0.00126487
feature information 0.001258916
baseline method 0.0012477439999999998
tag distributions 0.001246764
lexical probabilities 0.0012314679999999999
tag distri 0.001221072
tagging system 0.001216902
linguistic information 0.001205952
hmm tagging 0.001202407
tagging systems 0.0012014719999999999
test set 0.00119122
new dis 0.001160101
brown corpus 0.001154342
speech tagging 0.001147427
specific information 0.0011409089999999998
baseline system 0.001133808
full tagger 0.0011242119999999999
method open 0.001099295
other source 0.001096773
information sources 0.001094373
certain features 0.001079548
other hand 0.001071741
overall performance 0.001070159
entire training 0.0010352410000000001
current systems 0.001008546
full hmm 0.001000449
possible tags 9.993509999999999E-4
proved results 9.99031E-4
simple set 9.93289E-4
heuristic method 9.85663E-4
contextual probabilities 9.723399999999999E-4
probability distribution 9.411929999999999E-4
overall distribution 9.396669999999999E-4
common approach 9.318390000000001E-4
model 9.22895E-4
example set 9.20532E-4
various methods 9.08957E-4
statistical methods 8.895819999999999E-4
rough stem 8.88724E-4
ion table 8.76026E-4
modular approach 8.72793E-4
corpus 8.52163E-4
language 8.49074E-4
test files 8.40016E-4
only suffix 8.361919999999999E-4
likely sequence 8.28522E-4
several techniques 8.2366E-4
prefix distribution 8.2316E-4
test run 8.22403E-4
probability dis 8.14186E-4
markov assumption 8.11218E-4
english affixes 8.11059E-4
probability distri 8.102719999999999E-4
partial parsing 8.07131E-4
suffix distributions 8.047390000000001E-4
speech predictor 8.02726E-4
many others 8.00165E-4
information 7.99582E-4
suffix informa 7.93346E-4
smooth ing 7.922820000000001E-4
ther work 7.90557E-4
suffix dis 7.82961E-4
further work 7.82417E-4
sible tags 7.81063E-4
suffix distribu 7.7234E-4
following equation 7.63435E-4
tistical methods 7.61842E-4
hidden markov 7.5979E-4
training 7.52232E-4
important area 7.483990000000001E-4
initial experiments 7.40877E-4
