other sentence 0.0037363450000000003
same sentence 0.003647589
sentence number 0.00361314
sentence length 0.0035432750000000002
first sentence 0.003513009
similar sentence 0.003466601
large sentence 0.003430035
sentence statistics 0.00334294
sentence num 0.00331494
sentence numbers 0.0033117610000000003
preceding sentence 0.003311285
tenth sentence 0.0033059950000000004
sentence 0.00304854
sentences paragraph 0.002456924
first sentences 0.002223059
factor sentences 0.002150905
related sentences 0.002144071
second sentences 0.002139657
individual sentences 0.002082277
few sentences 0.002073871
ous sentences 0.002065994
group sentences 0.002063259
third sentences 0.002045725
later sentences 0.00204415
preceding sentences 0.002021335
gether sentences 0.002016551
sentences 0.00175859
word entropy 0.0014536000000000002
language model 0.0014497680000000002
training data 0.001354948
data set 0.001316036
journal text 0.001309736
entropy rate 0.001236506
news writing 0.001235802
information term 0.001200813
new paragraph 0.0011727180000000001
entropy results 0.001167987
same method 0.0011678840000000001
trigram model 0.0011606
testing data 0.001134026
average number 0.001132988
news article 0.00112509
mutual information 0.001118544
different testing 0.001111281
different genres 0.001096542
data point 0.001091904
same set 0.0010908200000000002
different ways 0.001089122
heterogeneous data 0.001083388
local context 0.001081752
human processing 0.001079466
entropy variation 0.001071866
different lan 0.001069015
conditional entropy 0.001059793
global context 0.001057685
news feed 0.001048952
news script 0.001045439
news reportage 0.001045439
paragraph boundaries 0.001042358
entropy increase 0.0010277160000000001
training set 0.001022454
previous work 0.001005802
paragraph break 0.00100066
entropy increases 0.001000076
natural language 9.954690000000001E-4
ith word 9.89735E-4
paragraph boundary 9.81623E-4
several words 9.79982E-4
ellipsis nodes 9.75626E-4
paragraph breaks 9.73445E-4
leaf nodes 9.6983E-4
course context 9.641159999999999E-4
other languages 9.61026E-4
small number 9.598829999999999E-4
human communications 9.54887E-4
few words 9.51983E-4
multiple topic 9.465820000000001E-4
language processing 9.390590000000001E-4
graph boundaries 9.353110000000001E-4
single topic 9.101930000000001E-4
average value 9.01818E-4
same scale 9.000410000000001E-4
model 8.9693E-4
national corpus 8.946570000000001E-4
same amount 8.941070000000001E-4
treebank corpus 8.914120000000001E-4
related work 8.854760000000001E-4
topic shift 8.83979E-4
topic shifts 8.81335E-4
natural way 8.79305E-4
graph breaks 8.66398E-4
information 8.41951E-4
first time 8.369499999999999E-4
language modeling 8.36592E-4
number increases 8.364199999999999E-4
nal average 8.305159999999999E-4
practical problem 8.29581E-4
future work 8.28278E-4
bucket number 8.277019999999999E-4
first term 8.233310000000001E-4
