first word 0.003371173
word boundaries 0.0030637549999999996
sanskrit word 0.0029989699999999997
word boundary 0.002981362
valid word 0.002952602
proper word 0.002949757
right word 0.002933285
word seg 0.002926902
different words 0.00289517
word vacah 0.002894312
other words 0.002539122
sanskrit words 0.002213
distinct words 0.002130644
words xaxi 0.002110954
meaningful words 0.0021089480000000002
insignificant words 0.0021089480000000002
different rule 0.001933773
words 0.0018404
sandhi rules 0.0017427
euphonic rules 0.001628848
other candidate 0.001619302
first character 0.001520269
different set 0.001517611
same corpus 0.001494543
different approaches 0.001472699
different ways 0.001424552
different constituents 0.001416198
morphological analyzers 0.001385159
sandhi rule 0.001371893
morphological analyzer 0.0013597599999999998
morphological analyser 0.0013187919999999998
input string 0.001305785
first approach 0.001304061
optimal candidate 0.0012958140000000002
rule sequence 0.001263155
parallel corpus 0.001251928
rules 0.00124981
valid candidate 0.001246812
winning candidate 0.0012002380000000002
test data 0.001191768
sanskrit string 0.0011905800000000001
above string 0.001159108
such cases 0.001157954
ith rule 0.001149713
cable rule 0.001148405
baseline system 0.001125909
first letter 0.0011144220000000001
continuous string 0.001112203
unicode string 0.0010882700000000001
other candidates 0.001084244
last character 0.0010784990000000001
frequency distribution 0.001071104
several characters 0.00106733
nlp system 0.0010649330000000001
segmentation process 0.00105224
first answer 0.001046508
translation system 0.0010387600000000001
final state 0.001034533
same test 0.001026053
noun morphology 0.001022521
following results 0.001021709
other languages 0.001019839
line system 0.001019051
possible candidates 0.0010180760000000001
first appli 0.001015028
second approach 0.001013278
automatic segmentation 0.001011939
sandhied form 9.98962E-4
split form 9.95895E-4
next state 9.70286E-4
other alternatives 9.684400000000001E-4
other possibilities 9.684400000000001E-4
main problem 9.67975E-4
dalone system 9.67392E-4
surface form 9.57876E-4
corpus 9.53896E-4
written form 9.532379999999999E-4
unsandhied form 9.51609E-4
time application 9.350270000000001E-4
possible seg 9.330860000000001E-4
syllable segmentation 9.2731E-4
sanskrit text 9.241659999999999E-4
novel method 9.20903E-4
candidate 9.2058E-4
finite state 9.180639999999999E-4
possible segmentations 9.15253E-4
previous approach 9.144839999999999E-4
possible outputs 9.11707E-4
possible transitions 9.10629E-4
current state 9.094179999999999E-4
initial segment 9.041259999999999E-4
correct output 9.03732E-4
state transducer 9.00996E-4
output rank 8.97009E-4
original part 8.90429E-4
start state 8.76283E-4
nlp systems 8.73985E-4
sandhied text 8.712419999999999E-4
nite state 8.69552E-4
split text 8.681749999999999E-4
