different word 0.004811522
word test 0.004724248
word form 0.004560679
context word 0.004491532
preceding word 0.004393138
target word 0.004371563
word part 0.00436481
certain word 0.004331328000000001
unambiguous word 0.004330837000000001
word forms 0.004301578
word tokens 0.004290451
word sequences 0.004289175
foreign word 0.004284073
lion word 0.00427388
unknown words 0.002027051
context words 0.002002822
certain words 0.001842618
lexical rule 0.0018156320000000002
known words 0.001797914
training data 0.00177537
lexical rules 0.001650211
training corpus 0.00162431
first rule 0.0016182520000000001
test data 0.001608613
possible rule 0.001595094
words 0.00153902
same data 0.001523133
training text 0.001516584
rule types 0.0015032790000000002
new rules 0.0014624960000000002
second rule 0.0014592020000000002
test corpus 0.0014575529999999999
different tags 0.001455112
important rules 0.00144225
different text 0.001437101
third rule 0.0014189600000000001
corpus text 0.001414344
rule states 0.001407732
rule deletes 0.0014040390000000002
ion rules 0.001319463
tag set 0.00131284
learning system 0.001297787
bad rules 0.001277415
different set 0.001262669
speech tag 0.0012551349999999999
barrier rules 0.001240924
elimination rules 0.001240433
lexicai rules 0.001238537
isambiguation rules 0.001238537
specific tag 0.0012199519999999998
tag sequence 0.001187356
target tag 0.001177796
unseen data 0.001160905
data files 0.001159045
syntactic structure 0.001150126
annotated training 0.001144737
unambiguous tag 0.00113707
training material 0.0011255520000000001
different part 0.001120872
morphological features 0.00111948
ing system 0.001111193
gol system 0.001099121
progol system 0.001098618
speech tags 0.001092492
tag elimination 0.001082897
different ways 0.001079411
ilp system 0.001067925
english text 0.001058104
sentational language 0.00105266
different kinds 0.001031319
syntactic grammar 0.0010197539999999999
text categories 0.001012126
brown corpus 0.0010116819999999999
grammatical information 0.001001153
rules 9.91499E-4
syntactic depen 9.76462E-4
other respects 9.72769E-4
same time 9.665940000000001E-4
inary test 9.42971E-4
same tagset 9.35616E-4
unrestricted text 9.09636E-4
text genres 9.0527E-4
swedish text 9.03562E-4
speech tagging 9.03079E-4
tagging process 8.957489999999999E-4
correct reading 8.747799999999999E-4
training 8.63275E-4
several types 8.605870000000001E-4
present work 8.37638E-4
current work 8.202350000000001E-4
system 8.10152E-4
background knowledge 8.09445E-4
alent number 8.07333E-4
language 8.04556E-4
noise level 7.996769999999999E-4
machine learning 7.95926E-4
brill tagger 7.83894E-4
several years 7.8019E-4
speech categories 7.79989E-4
suc tagging 7.70022E-4
