word tag 0.0028989000000000003
word tags 0.002735326
morphological data 0.00268753
word segmentation 0.0025603789999999998
single word 0.002542966
analyses word 0.002464798
word frequencies 0.002428025
original word 0.002418215
word categories 0.0023873979999999998
word category 0.002383669
actual word 0.002377409
word cate 0.00236916
different features 0.002338774
morphological analysis 0.002330397
morphological information 0.0022899269999999998
words approach 0.002120715
different feature 0.002106404
main features 0.002037392
example features 0.002027304
feature set 0.002018057
training data 0.001934979
morphological analyses 0.001919018
primary features 0.001917725
informative features 0.001901237
informative words 0.001875077
stop words 0.001855947
morphological informa 0.001843316
morphological analyser 0.001826326
morphological disambiguator 0.00182087
morphological analy 0.001817884
feature space 0.001794712
small feature 0.001770117
feature sets 0.001712296
model representation 0.001689494
space model 0.0016774519999999998
first tag 0.0016313389999999999
features 0.00162911
test data 0.00161344
words 0.00160295
several learning 0.001555347
pos tag 0.001535953
data representation 0.001531664
data sets 0.0014372060000000001
data size 0.0014329170000000001
annotated data 0.0014219670000000001
data sizes 0.001402683
feature 0.00139674
text classification 0.001388772
enough data 0.00138057
pos tags 0.001372379
language tasks 0.001343234
text documents 0.001316081
natural language 0.001294674
model 0.00127948
tive language 0.001273575
language identification 0.001257574
first characters 0.001256148
text categorization 0.001222871
text classi 0.001203033
different test 0.001201454
tag experiments 0.00119801
learning algorithms 0.001197374
different languages 0.001188508
syntactic information 0.001180251
different number 0.0011795870000000002
different classifiers 0.0011750950000000001
learning curves 0.001174442
stem tag 0.001169806
text files 0.001165637
ish text 0.001159757
few training 0.001149907
first setting 0.001136771
cal analysis 0.00113447
pos tagging 0.001125804
training size 0.001124596
other level 0.0011211300000000001
logical analysis 0.001118937
test set 0.001113107
first derivation 0.001109435
best first 0.0010985259999999998
first syllable 0.0010985259999999998
different fea 0.001097944
common analysis 0.001092592
correct analysis 0.001090741
different prefix 0.001084862
different settings 0.001078165
training sam 0.001075489
speech tags 0.001075344
training files 0.0010722610000000001
analysis process 0.001072115
training instances 0.001068777
classification task 0.001067077
ical information 0.001037774
document classification 0.001034905
extra information 0.0010324449999999999
information retrieval 0.001025003
character prefix 0.001013901
stem tags 0.0010062320000000001
different dimension 0.0010055560000000001
tags figure 0.0010038220000000001
