training word 0.0027003699999999997
training data 0.00237659
word class 0.0023662379999999997
same word 0.002041337
new word 0.002041322
word list 0.002019777
word segmentation 0.0020057869999999998
word level 0.001942442
word labels 0.001922804
independent word 0.0019193959999999999
word types 0.001913513
class training 0.001904228
such training 0.001901919
seed word 0.0018801089999999998
word boundaries 0.0018542239999999998
training words 0.001831509
allowable word 0.001825675
word separators 0.0018219979999999998
optional word 0.0018204529999999999
training information 0.001796078
large data 0.001772673
same data 0.001717557
training classification 0.001712657
training set 0.001711918
test data 0.0016937150000000002
training names 0.001680471
different training 0.001675046
ing data 0.001661809
test text 0.0016029249999999998
training list 0.001557767
seed data 0.001556329
additional training 0.001541316
other words 0.001531458
data structures 0.0015262140000000001
right context 0.00151827
data structure 0.0015179
small training 0.001512447
learning system 0.001509421
system classification 0.0015063860000000002
flexible data 0.001504151
data sets 0.00150409
total training 0.001499149
raw text 0.001498264
baseline model 0.001493401
data files 0.001489794
hindi data 0.001481639
following model 0.001474643
romanian text 0.001467503
entity class 0.001467348
particular context 0.0014604549999999998
such information 0.001459637
left context 0.001448373
text paths 0.001445924
training wordlist 0.001443375
original training 0.001438826
context source 0.00143644
initial training 0.0014312819999999999
context tries 0.00142933
training examples 0.001427989
text tries 0.00142765
seed training 0.001418099
training seed 0.001418099
default model 0.00140778
unannotated text 0.001400372
context rie 0.001397104
tokenized text 0.001393238
training texts 0.001390856
hindi text 0.001390849
context hey 0.0013906909999999999
context links 0.0013906909999999999
text acquisition 0.001389288
text siz 0.001389288
general model 0.001387269
training file 0.001381985
other names 0.00138042
training resources 0.001378863
system performance 0.00136902
sample training 0.001367325
unannotated training 0.001352932
actual training 0.001352746
training wordlists 0.001346933
training seeds 0.001340701
ystem training 0.001340701
morphological information 0.0013382020000000001
segmentation system 0.001337506
core model 0.0013330500000000001
model combinatiofi 0.00133074
first class 0.0013257149999999999
language processing 0.001294211
other languages 0.001284362
entity classification 0.001275777
independent system 0.0012511150000000001
other node 0.001248355
name class 0.001247039
same class 0.001245195
new class 0.00124518
romanian language 0.001240669
natural language 0.0012401439999999999
accuracy system 0.001232564
default system 0.0012114790000000001
