language model 0.0030822000000000002
data model 0.00302162
language models 0.0023819419999999997
large data 0.002316163
work language 0.00220995
test data 0.002162006
language modeling 0.002110771
simple language 0.002097576
statistical language 0.002068513
basic language 0.002017104
language mod 0.001961235
data set 0.001954556
naive language 0.001910185
building language 0.001904556
entire data 0.001899803
data sets 0.001890586
trial data 0.001865097
data transfer 0.001859254
data sparsity 0.00185519
plicate data 0.001843113
cate data 0.001843113
model precision 0.0018182810000000002
memory space 0.001664166
word sequences 0.0016360289999999998
language 0.00163518
new word 0.001633944
memory cost 0.0016086
target word 0.001557352
other words 0.001509416
model 0.00144702
memory requirements 0.001438917
available memory 0.001429605
final memory 0.001425522
memory pointers 0.001388075
other tool 0.001313329
web corpus 0.001293083
key value 0.001288364
other nlp 0.001251563
gram corpus 0.001251282
other hand 0.001246016
entire corpus 0.001233854
large number 0.001232093
smoothing method 0.001223523
smoothing algorithm 0.0012068640000000002
order probability 0.001204587
brown corpus 0.001185504
statistical models 0.001180095
other tools 0.001176635
trigram models 0.001170329
key values 0.001166841
single file 0.00116434
node order 0.0011618779999999999
large corpora 0.001141509
search algorithm 0.001122444
first key 0.001121302
efficient method 0.001101921
index size 0.0010980599999999999
lookup value 0.001092951
large bucket 0.00108887
trigram file 0.001080889
integer value 0.001074864
large files 0.001072516
memory 0.00106058
bucket size 0.0010578879999999999
machine translation 0.001051264
disk space 0.001044379
average size 0.001040201
simple approach 0.001036613
large datasets 0.001027828
different senses 0.001025225
probability distribution 0.001021924
order parameter 0.001021052
simple implementation 0.0010134710000000002
total count 0.001005537
such queries 0.001003824
access time 9.92061E-4
disk cost 9.888129999999998E-4
order probabilities 9.8079E-4
search tree 9.80188E-4
optimal storage 9.756230000000001E-4
smoothing algorithms 9.69089E-4
factor values 9.509669999999999E-4
single disk 9.47811E-4
missing counts 9.40134E-4
ated counts 9.352550000000001E-4
rect file 9.28447E-4
binary search 9.140519999999999E-4
target words 9.13531E-4
indexing time 9.12868E-4
tree structure 9.104009999999999E-4
card queries 9.09489E-4
corpus 9.08651E-4
first integer 9.078020000000001E-4
test dataset 9.072549999999999E-4
order parameters 9.021529999999999E-4
indexing cost 8.97864E-4
online performance 8.93487E-4
last read 8.904200000000001E-4
lexicographic order 8.87346E-4
long time 8.853139999999999E-4
