arabic word 0.0036938
arabic text 0.002393016
word segmentation 0.002386018
standard arabic 0.0023740230000000003
model training 0.002362792
segmentation model 0.002341708
arabic corpus 0.00231457
dialectal arabic 0.0022778720000000002
news arabic 0.002270383
arabic letter 0.002221763
feature set 0.002199874
multiple word 0.002197653
current word 0.00219082
word length 0.00217944
arabic segmenter 0.002175092
model output 0.002159751
egyptian arabic 0.002140882
arabic orthography 0.002140662
arabic treebank 0.002139198
arabic nlp 0.0021330060000000002
crf model 0.002120588
word seg 0.002113091
arabic clitics 0.002109111
arabic segmenta 0.002104963
bic word 0.002101753
arabic parsing 0.002098928
tal arabic 0.002096544
word senses 0.0020871740000000002
arabic dialects 0.002082913
informal arabic 0.002072902
stanford word 0.002070403
single model 0.002069997
word segmenters 0.002069555
levantine arabic 0.002069043
arabic ortho 0.002069043
maghrebi arabic 0.002069043
arabic construction 0.002069043
final model 0.002053982
additional feature 0.00203928
simple feature 0.002034173
denero model 0.002029178
augmented model 0.002025134
feature space 0.0019349439999999999
additional features 0.0019192599999999999
original feature 0.001866683
arabic 0.00184679
model 0.0018027
indicator feature 0.001783411
feature templates 0.001782582
feature map 0.0017767549999999999
new features 0.001772651
independent features 0.001695056
indicator features 0.001663391
ditional features 0.00166274
feature 0.00155358
features 0.00143356
foreign words 0.001411178
training data 0.001385272
morphological analyzer 0.001374067
morphological richness 0.001298345
dialectal data 0.001256262
error analysis 0.001248238
set results 0.001231088
dialect data 0.001229654
gold data 0.001191571
data atb 0.001176713
words 0.0011691
annotated data 0.001164635
character class 0.001121501
treebank data 0.001117588
set errors 0.0011154099999999998
test set 0.0011148299999999998
development data 0.001110219
error categories 0.0010929919999999999
other types 0.0010790630000000001
available data 0.001069374
typographical error 0.001067605
current character 0.001045912
other systems 9.91259E-4
other interactions 9.90226E-4
results table 9.86904E-4
dialectal text 9.77308E-4
label space 9.749399999999999E-4
machine translation 9.649200000000001E-4
unicode character 9.61455E-4
natural language 9.598250000000001E-4
other ambiguities 9.545690000000001E-4
original system 9.43023E-4
same split 9.380460000000001E-4
development set 9.313329999999999E-4
language processing 9.216690000000001E-4
segmented sentence 9.094610000000001E-4
additional context 9.06181E-4
correct analysis 9.04699E-4
gold standard 8.93624E-4
current text 8.900360000000001E-4
surrounding sentence 8.74823E-4
clitic segmentation 8.747760000000001E-4
joint segmentation 8.74562E-4
simple domain 8.69515E-4
