38 lines
1.0 KiB
Plaintext
38 lines
1.0 KiB
Plaintext
====== Perplexity statistics ======
|
|
Mean PPL(Q) : 18.723777 ± 0.145380
|
|
Mean PPL(base) : 6.180577 ± 0.041038
|
|
Cor(ln(PPL(Q)), ln(PPL(base))): 75.90%
|
|
Mean ln(PPL(Q)/PPL(base)) : 1.108382 ± 0.005110
|
|
Mean PPL(Q)/PPL(base) : 3.029454 ± 0.015481
|
|
Mean PPL(Q)-PPL(base) : 12.543199 ± 0.117315
|
|
|
|
====== KL divergence statistics ======
|
|
Mean KLD: 1.226150 ± 0.004006
|
|
Maximum KLD: 27.303829
|
|
99.9% KLD: 13.319038
|
|
99.0% KLD: 8.045850
|
|
99.0% KLD: 8.045850
|
|
Median KLD: 0.778573
|
|
10.0% KLD: 0.072866
|
|
5.0% KLD: 0.026539
|
|
1.0% KLD: 0.004645
|
|
Minimum KLD: 0.000149
|
|
|
|
====== Token probability statistics ======
|
|
Mean Δp: -17.217 ± 0.084 %
|
|
Maximum Δp: 93.510%
|
|
99.9% Δp: 74.449%
|
|
99.0% Δp: 50.740%
|
|
95.0% Δp: 23.161%
|
|
90.0% Δp: 9.707%
|
|
75.0% Δp: 0.007%
|
|
Median Δp: -4.675%
|
|
25.0% Δp: -31.413%
|
|
10.0% Δp: -71.201%
|
|
5.0% Δp: -90.111%
|
|
1.0% Δp: -99.699%
|
|
0.1% Δp: -99.959%
|
|
Minimum Δp: -99.998%
|
|
RMS Δp : 36.807 ± 0.087 %
|
|
Same top p: 63.002 ± 0.124 %
|