word model 0.00573808
word error 0.005020538
correction word 0.005017937
word probability 0.004873137
word segmentation 0.004802338
word spelling 0.004721095
unknown word 0.004651954
word length 0.0046340489999999995
word distance 0.004621726
word sequence 0.004614771
statistical word 0.0045434279999999995
candidate word 0.004524026
word boundary 0.00449079
word confusion 0.004465164
character words 0.00445941
word bigram 0.004430936999999999
hidden word 0.004430082
word match 0.0043989319999999995
average word 0.004390161
approximate word 0.004386948999999999
distinct word 0.004381147
correct word 0.004371682
imate word 0.004362314
likely word 0.004362067
short word 0.004361126999999999
word matching 0.00435943
word prob 0.004348681
word hypotheses 0.004348443
sonable word 0.004346159
word boundaries 0.004345722
word bigrams 0.004340347
word segmentat 0.004337917
dependent word 0.004329317
word perplexity 0.004328846999999999
word symbol 0.0043246199999999995
partial word 0.004324075
mate word 0.0043194869999999995
word delimiters 0.0043185739999999995
isolated word 0.0043185739999999995
character ocr 0.0038969699999999996
large character 0.0031985829999999996
japanese character 0.003197958
character sequence 0.003167321
input character 0.003149539
character recognition 0.003109388
output character 0.0031077699999999997
character set 0.003085363
same character 0.003070715
character confusion 0.003017714
character similarity 0.0029848239999999996
ocr model 0.00298306
character clustering 0.0029751789999999997
character classes 0.002969022
character uni 0.002956786
correct character 0.0029242319999999997
character sets 0.002923335
character shape 0.0028976329999999997
character bigrams 0.0028928969999999997
character ocrs 0.0028756059999999997
character pairs 0.002874649
character matrices 0.002874196
character ecognition 0.002871749
character confu 0.002871749
character distri 0.002871749
ted character 0.002871749
language model 0.00273133
character 0.00260227
unknown words 0.002459374
dictionary words 0.002454285
segmentation model 0.002440978
distance words 0.002429146
model first 0.002381869
ocr errors 0.002275357
ocr error 0.0022655180000000002
ion model 0.002216317
distinct words 0.0021885669999999998
foreign words 0.002149822
frequent words 0.002145184
known words 0.002136449
bigram model 0.002069577
channel model 0.002067982
ocr training 0.002030385
anguage model 0.0020038350000000003
model let 0.002002003
ocr spelling 0.001966075
error correction 0.001939035
japanese ocr 0.001890388
words 0.00185714
ocr output 0.0018002
statistical ocr 0.001788408
correction method 0.001733985
baseline ocr 0.0017086459999999999
model 0.00168836
novel ocr 0.001671999
spelling correction 0.001639592
ocr mode 0.001621542
acter ocr 0.0016106739999999999
language models 0.001605073
ocr accuracies 0.001594239
ocr score 0.001592371
