training data 0.002233432
language data 0.002053484
data set 0.001969148
feature test 0.001955466
feature value 0.001864105
other features 0.0018198099999999998
full data 0.001770426
feature space 0.0017518030000000001
base data 0.001731916
annotated data 0.001727249
feature selection 0.00171178
optimal feature 0.001696097
unseen data 0.001655936
feature generation 0.0016058819999999999
feature weighting 0.001605208
dimensional feature 0.001596342
binary feature 0.001583573
feature construction 0.001582833
ratio feature 0.001579808
feature range 0.0015720719999999999
word class 0.0015711990000000001
feature selec 0.001564034
numerical feature 0.001559694
float feature 0.00155749
phrasometer feature 0.00155749
word form 0.001531842
learning algorithm 0.001464355
function word 0.0014630280000000001
input features 0.001428702
shallow features 0.0014168549999999999
word forms 0.001378794
numerical features 0.001373884
feature 0.00135308
training set 0.00131006
test corpus 0.001296809
pos information 0.001269823
word distinction 0.001258239
required word 0.001258239
same corpus 0.00124543
function words 0.001204225
features 0.00116727
machine learning 0.001133702
learning algorithms 0.001131834
classifier set 0.00111274
training instances 0.0010934060000000001
current training 0.001086878
certain words 0.001076986
classification task 0.001059475
learning algo 0.001041551
content words 0.0010361910000000001
available training 0.001032961
inductive learning 0.001025287
lazy learning 0.0010226150000000002
head words 0.001010224
ing set 0.001007993
training material 0.001007771
half words 9.99611E-4
training partition 9.91498E-4
discourse information 9.8309E-4
information content 9.80447E-4
annotated corpus 9.754119999999999E-4
good algorithm 9.65844E-4
information field 9.60851E-4
crucial information 9.446999999999999E-4
pos tagging 9.41028E-4
joint corpus 9.33723E-4
classifier parameters 9.33503E-4
task combination 9.249670000000001E-4
ing pitch 9.19566E-4
punctuation baseline 9.187189999999999E-4
distance classifier 9.183679999999999E-4
ilk corpus 9.167590000000001E-4
corpus material 9.15022E-4
test instances 9.0862E-4
eindhoven corpus 8.9916E-4
same time 8.80542E-4
present task 8.75921E-4
time system 8.729199999999999E-4
prosodic pitch 8.69038E-4
test sets 8.66528E-4
combined task 8.58627E-4
other points 8.58135E-4
building algorithm 8.57442E-4
level breaks 8.546420000000001E-4
full pos 8.542860000000001E-4
same size 8.45147E-4
task definition 8.44637E-4
fication task 8.42027E-4
value frequency 8.38827E-4
classifier settings 8.34981E-4
classification accuracy 8.349119999999999E-4
binary test 8.32879E-4
neighbour classifier 8.2109E-4
pos tags 8.20772E-4
independent test 8.20233E-4
tts system 8.20103E-4
learning 8.18702E-4
prosodic boundary 8.17939E-4
prosodic annotation 8.11286E-4
dutch text 8.10982E-4
