word sequence 0.003267258
sms word 0.0032636749999999997
input word 0.002933186
word error 0.002851073
word lattice 0.002737765
probable word 0.002714323
word boundaries 0.00269842
word formations 0.00268084
language model 0.0023916479999999997
phonetic model 0.0022797729999999997
lexical model 0.002233085
translation model 0.002226891
different normalization 0.00209256
sms normalization 0.002072335
normalization sms 0.002072335
sms words 0.0018980949999999998
split model 0.001801556
this model 0.0017976379999999998
model riv 0.001795924
model roov 0.001778403
normalization algorithm 0.0017691270000000001
possible normalization 0.001742193
normalization models 0.001736659
normalization process 0.001716401
normalization step 0.001653108
sms language 0.0016454529999999998
normalization task 0.001643844
normalization framework 0.00162707
sms sequence 0.0016171929999999998
second normalization 0.0015914319999999998
true normalization 0.001583118
normalization part 0.001570279
model 0.001553
next normalization 0.001546424
normalization module 0.001531827
output words 0.001526368
last normalization 0.0015249299999999999
normalization steps 0.0015238959999999998
probable normalization 0.0015229829999999999
lexical language 0.001518733
sms text 0.001483459
sms corpus 0.0014747839999999998
oov words 0.00139504
other sequence 0.001391237
other sms 0.0013876539999999999
synthesis system 0.001364348
vocabulary words 0.001363683
sms sentences 0.001345752
noisy sequence 0.0013244189999999999
agglutinated words 0.001315507
language models 0.001309777
input sequence 0.001286704
normalization 0.00126553
asr system 0.001248493
standard nlp 0.001231426
noisy data 0.001226509
sms message 0.0012222589999999998
sms form 0.001189883
message standard 0.0011876690000000001
same corpus 0.001170633
sms sequences 0.0011701559999999999
language processing 0.001161294
lexical forms 0.001154173
input text 0.00115297
source language 0.001143352
other systems 0.0011359590000000002
standard transcription 0.001127824
machine translation 0.001120004
natural language 0.00111897
target language 0.0011182100000000001
sms modules 0.001117602
modules sms 0.001117602
oov sequence 0.001114138
initial sequence 0.001109485
frequent sms 0.001093787
text message 0.001092108
words 0.00109129
phonetic sequences 0.001090124
french sms 0.001087312
print sms 0.001073787
first character 0.001073552
different ways 0.001066806
oral language 0.001062677
sms messages 0.0010604079999999999
translation task 0.001052205
other models 0.001051978
graphemic sequence 0.001047013
corpus alignment 0.0010414439999999999
preprocessing sms 0.0010356649999999998
sms preprocessing 0.0010356649999999998
translation framework 0.001035431
sequence aussi 0.001034648
sms postprocessing 0.001033473
system 0.00101589
training corpus 0.001013663
standard deviation 0.001008168
noisy tokens 0.00100501
postprocessing standard 9.98883E-4
phonetic abbreviations 9.985929999999999E-4
standard notion 9.96641E-4
