word pairs 0.0019971420000000004
other word 0.0019898200000000002
such data 0.001809531
many word 0.001672756
possible word 0.0016459480000000002
word association 0.001630001
test data 0.001509832
english word 0.0014873
word alignment 0.0014609830000000001
distinct word 0.001438252
word types 0.001436845
data problem 0.0014327530000000002
french word 0.001427464
bilingual word 0.001414261
word associations 0.00140442
gual word 0.001397035
other measure 0.0013692
only data 0.0013671450000000002
data analysis 0.001321466
other words 0.001318684
hansards data 0.001293946
present data 0.001293143
possible pairs 0.0012905500000000001
same probability 0.0012859870000000002
sparse data 0.001280679
exploratory data 0.0012582890000000001
enough data 0.0012582890000000001
hansard data 0.0012582890000000001
expected pairs 0.001152036
pair corpus 0.001141378
sentence pairs 0.001127867
new method 0.001118886
unseen pairs 0.001067649
information score 0.001059665
joint frequency 0.00105566
other optimizations 0.001049401
general method 0.001049089
such mea 0.00104852
same number 0.001047219
singleton pairs 0.00104327
sentence pair 0.0010410039999999999
nonindependent pairs 0.0010402690000000001
other equiv 0.001034288
association score 0.001027098
conditional probability 0.001025786
exact method 0.001020479
statistical measure 0.001017856
association measure 0.001009381
marginal probability 0.001007518
such sequences 9.97522E-4
plausible measures 9.93824E-4
first association 9.90006E-4
pair counts 9.87518E-4
exact probability 9.66581E-4
probability distributions 9.637000000000001E-4
negative log 9.51889E-4
same likelihood 9.434840000000001E-4
hypergeometric probability 9.14806E-4
overall probability 9.12822E-4
probability distri 9.10332E-4
same sentence 9.05497E-4
mutual information 8.94953E-4
normal distribution 8.92548E-4
association noise 8.906680000000001E-4
high noise 8.90535E-4
different sizes 8.82666E-4
same sequence 8.824880000000001E-4
joint occurrences 8.801550000000001E-4
llr score 8.641339999999999E-4
joint frequencies 8.62398E-4
small number 8.56353E-4
negative association 8.42574E-4
joint count 8.39771E-4
joint fre 8.364100000000001E-4
significant association 8.311340000000001E-4
same combinations 8.28134E-4
bad measure 8.24986E-4
language processing 8.235989999999999E-4
statistical methods 8.209879999999999E-4
pairs 8.20872E-4
same conditions 8.20095E-4
marginal frequency 8.19806E-4
english words 8.16164E-4
statistical nlp 8.13279E-4
joint occurrence 8.09305E-4
first column 8.07876E-4
following values 8.066939999999999E-4
linear regression 7.99232E-4
sociation score 7.95962E-4
score thresholds 7.9411E-4
large numbers 7.86932E-4
binomial distribution 7.843710000000001E-4
log operation 7.8272E-4
ative log 7.8272E-4
plausible measure 7.79638E-4
ciation measure 7.79375E-4
joint frequen 7.784350000000001E-4
statistical natural 7.76615E-4
large collec 7.743609999999999E-4
measures 7.69836E-4
