word training 0.0017823920000000003
training data 0.001737882
rule system 0.001683331
test data 0.00164749
word form 0.00156608
data set 0.0015480679999999999
same data 0.001519812
large data 0.001515953
input word 0.0014902740000000002
possible tag 0.001484591
word order 0.001441137
small data 0.0014348579999999998
statistical system 0.001434335
data size 0.0014255129999999998
lexical information 0.001411018
rule component 0.001408224
data problem 0.0014059349999999999
different taggers 0.001402879
statistical tagger 0.001401279
tag language 0.00138739
ambiguous word 0.001377729
total data 0.0013695019999999999
morphological ambiguity 0.001368883
language model 0.001356428
such rules 0.001351937
word forms 0.001317646
original system 0.001315711
lexical model 0.001311904
word conditioning 0.001311784
free word 0.001311754
disambiguation rules 0.001309477
word sense 0.001308001
evaluation method 0.001305392
word bigrams 0.001304004
manual rule 0.001302291
data sparseness 0.0012861029999999998
morphological analyzer 0.001262649
single rule 0.001261873
immediate data 0.001258815
sparse data 0.001258815
heldout data 0.001258815
general rules 0.0012579590000000001
possible tags 0.001256288
hmm tag 0.00123572
manual rules 0.001234559
correct tag 0.001228491
reliable rules 0.001222743
tagger performance 0.001214343
next rule 0.001210857
morphological dictionary 0.00118663
tag sequence 0.00118268
hmm tagger 0.001178722
whole rule 0.001169858
morphological processor 0.0011620419999999998
statistical component 0.001159228
statistical learning 0.001158768
rule compo 0.001151783
rule development 0.001145436
rules combined 0.001141538
small training 0.00113772
czech tag 0.001134341
tagger table 0.001123108
english tag 0.0011127659999999998
corpus material 0.0011096819999999999
tagging results 0.001104642
statistical tagging 0.001093635
wrong tag 0.0010852449999999999
statistical hmm 0.001085153
final evaluation 0.001082447
same time 0.001082301
czech tagger 0.0010773430000000001
available tagger 0.00107671
testing corpus 0.001070845
first time 0.001061417
combined tagger 0.001053843
ing results 0.001052435
tag trigrams 0.001048643
lexical component 0.001043817
relative frequency 0.001035628
ical model 0.001031501
annotator tagger 0.00103126
tagger might 0.001028789
hybrid system 0.001024839
statistical classifier 0.0010169839999999999
sentence error 0.001011854
same form 0.001006362
tagger male 0.0010017329999999999
enough training 9.811490000000002E-4
ing precision 9.72897E-4
such ambiguity 9.65905E-4
test sets 9.64714E-4
single sentence 9.64599E-4
ing tags 9.63692E-4
full disambiguation 9.63039E-4
machine learning 9.55424E-4
input text 9.534389999999999E-4
statistical classifiers 9.51691E-4
such systems 9.442690000000001E-4
czech sentence 9.354960000000001E-4
important task 9.28835E-4
