38 lines
1.0 KiB
Plaintext
38 lines
1.0 KiB
Plaintext
====== Perplexity statistics ======
|
|
Mean PPL(Q) : 19.765437 ± 0.153182
|
|
Mean PPL(base) : 6.180577 ± 0.041038
|
|
Cor(ln(PPL(Q)), ln(PPL(base))): 74.13%
|
|
Mean ln(PPL(Q)/PPL(base)) : 1.162523 ± 0.005278
|
|
Mean PPL(Q)/PPL(base) : 3.197992 ± 0.016878
|
|
Mean PPL(Q)-PPL(base) : 13.584860 ± 0.125811
|
|
|
|
====== KL divergence statistics ======
|
|
Mean KLD: 1.295119 ± 0.004177
|
|
Maximum KLD: 27.306818
|
|
99.9% KLD: 13.228414
|
|
99.0% KLD: 8.401047
|
|
99.0% KLD: 8.401047
|
|
Median KLD: 0.824191
|
|
10.0% KLD: 0.081751
|
|
5.0% KLD: 0.029987
|
|
1.0% KLD: 0.004980
|
|
Minimum KLD: 0.000157
|
|
|
|
====== Token probability statistics ======
|
|
Mean Δp: -18.417 ± 0.085 %
|
|
Maximum Δp: 92.683%
|
|
99.9% Δp: 74.856%
|
|
99.0% Δp: 50.844%
|
|
95.0% Δp: 22.899%
|
|
90.0% Δp: 9.183%
|
|
75.0% Δp: 0.001%
|
|
Median Δp: -5.493%
|
|
25.0% Δp: -34.119%
|
|
10.0% Δp: -74.211%
|
|
5.0% Δp: -91.714%
|
|
1.0% Δp: -99.759%
|
|
0.1% Δp: -99.964%
|
|
Minimum Δp: -99.995%
|
|
RMS Δp : 38.004 ± 0.087 %
|
|
Same top p: 61.136 ± 0.125 %
|