word type 0.00268662
training data 0.0026393
model tagger 0.0024200619999999997
word types 0.002405666
data tokens 0.0023220940000000002
other word 0.002316763
annotation type 0.00225693
type annotation 0.00225693
markov model 0.002133777
model minimization 0.002112589
computational model 0.002103916
annotated data 0.002098057
test data 0.002079778
eng data 0.0020574820000000002
raw data 0.002051496
little data 0.002001566
word frequency 0.001989564
annotation types 0.001975976
data sources 0.001971081
mlg data 0.0019697910000000003
unannotated data 0.001966029
frequent word 0.001959059
data kinyarwanda 0.001958038
unannotated word 0.001951979
kin data 0.001937852
enough data 0.001937072
incomplete data 0.001926106
notated data 0.001924944
velopment data 0.0019178840000000001
data collection 0.001916962
pos tags 0.0018584959999999998
model 0.00184469
type information 0.00175242
token annotation 0.0017474920000000002
annotation time 0.0017403330000000002
pos labels 0.001722196
pos tagset 0.0016792919999999998
tag dictionary 0.001655048
pos task 0.0016332389999999999
ing pos 0.001628182
type annotations 0.001623179
high annotation 0.0016230370000000001
pos experiments 0.001610381
additional annotation 0.0015788310000000002
token type 0.001556722
time type 0.001549563
annotation studies 0.0015487040000000001
tagger training 0.001547082
target pos 0.001530524
previous pos 0.0015243039999999998
final pos 0.0015230439999999999
annotation efforts 0.001515941
tween annotation 0.0015143230000000001
tional annotation 0.0015131440000000001
annotation mixture 0.001511579
realistic annotation 0.0015081040000000001
annotation effort 0.001496334
ing type 0.001492042
fixed annotation 0.001490554
annotation scenarios 0.0014861840000000002
training corpus 0.0014851629999999998
annotation budget 0.001478386
effective pos 0.001476961
annotation proportions 0.0014751
elapsed annotation 0.001473471
new tag 0.0014651339999999999
tial pos 0.001462447
different information 0.0014610679999999998
distinct pos 0.001436969
type anno 0.001429598
share pos 0.0014174889999999999
tag distributions 0.001409823
tokens types 0.0014066299999999999
natural language 0.001398034
type supervision 0.001390435
resource language 0.001384973
language processing 0.0013698270000000001
automatic tag 0.001369082
tag dic 0.001365182
tag dictionaries 0.001362381
noisy tag 0.001361443
initial tag 0.0013559219999999999
raw training 0.001355616
mlg type 0.001335281
likely tag 0.001331136
tag bigrams 0.001328169
training sequence 0.001326861
tween type 0.001323553
rich language 0.001319578
kin type 0.001303342
respective language 0.001290506
approach learning 0.0012894450000000002
learning approach 0.0012894450000000002
type tok 0.001288239
tok type 0.001288239
type annota 0.001287376
type proportions 0.00128433
mixed type 0.001283215
type proportion 0.001281585
mixing type 0.001281585
