arabic word 0.0037112300000000003
arabic words 0.00295123
arabic text 0.002431838
arabic corpus 0.002361017
word segmentation 0.002347923
feature set 0.002311853
standard arabic 0.002311695
news arabic 0.00230958
dialectal arabic 0.002268637
current word 0.002232462
multiple word 0.002232256
arabic letter 0.002188851
word length 0.002153021
bic word 0.002141123
arabic segmenter 0.002136159
arabic treebank 0.002131518
model training 0.002130606
arabic nlp 0.002128113
egyptian arabic 0.002125691
word seg 0.002125095
arabic orthography 0.002123856
word senses 0.002123253
simple feature 0.0021101640000000003
stanford word 0.002106261
word segmenters 0.002105396
arabic parsing 0.002095412
tal arabic 0.0020832990000000003
alectal arabic 0.002077518
arabic affixes 0.002074229
arabic dialects 0.002070881
informal arabic 0.002065886
levantine arabic 0.002057153
arabic orthog 0.002057153
arabic ortho 0.002057153
arabic segmenta 0.002057153
arabic construction 0.002057153
additional feature 0.002053959
feature space 0.0020212800000000003
segmentation model 0.001984103
original feature 0.0019592150000000003
additional features 0.001925969
new features 0.001902228
model output 0.001884232
feature templates 0.0018755240000000002
indicator feature 0.0018750210000000002
feature map 0.0018681120000000001
crf model 0.001842111
arabic 0.00183168
final model 0.001788747
single model 0.001781105
independent features 0.0017677769999999999
indicator features 0.001747031
ditional features 0.0017453199999999999
denero model 0.001744756
augmented model 0.001741099
feature 0.00164162
model 0.00151573
features 0.00151363
training data 0.001490388
morphological analyzer 0.001373306
foreign words 0.0013696609999999999
dialectal data 0.001312469
morphological richness 0.001293751
error analysis 0.0012860229999999999
set results 0.001281871
dialect data 0.00127837
data atb 0.0012410589999999999
set errors 0.001213406
gold data 0.00120701
character class 0.001206153
development data 0.001177544
annotated data 0.001177142
treebank data 0.00117535
test set 0.0011623269999999999
available data 0.001123298
words 0.00111955
other types 0.001102311
error categories 0.001090786
typographical error 0.001066933
current character 0.0010453629999999999
dialectal text 0.0010371150000000001
results table 0.001022669
correct analysis 0.001014831
other interactions 0.001001287
label space 9.78259E-4
natural language 9.74437E-4
development set 9.722649999999999E-4
other ambiguities 9.650580000000001E-4
machine translation 9.649470000000001E-4
same split 9.58922E-4
unicode character 9.556739999999999E-4
current text 9.530700000000001E-4
language processing 9.32635E-4
original system 9.234130000000001E-4
training sets 9.17391E-4
segmented sentence 9.15407E-4
velopment set 8.9707E-4
segmented training 8.90978E-4
formal text 8.811040000000001E-4
surrounding sentence 8.66858E-4
