38 lines
1.0 KiB
Plaintext
38 lines
1.0 KiB
Plaintext
====== Perplexity statistics ======
|
|
Mean PPL(Q) : 18.174846 ± 0.142320
|
|
Mean PPL(base) : 6.180577 ± 0.041038
|
|
Cor(ln(PPL(Q)), ln(PPL(base))): 75.14%
|
|
Mean ln(PPL(Q)/PPL(base)) : 1.078627 ± 0.005222
|
|
Mean PPL(Q)/PPL(base) : 2.940639 ± 0.015356
|
|
Mean PPL(Q)-PPL(base) : 11.994269 ± 0.114726
|
|
|
|
====== KL divergence statistics ======
|
|
Mean KLD: 1.159685 ± 0.004238
|
|
Maximum KLD: 28.100733
|
|
99.9% KLD: 14.541190
|
|
99.0% KLD: 8.790474
|
|
99.0% KLD: 8.790474
|
|
Median KLD: 0.682733
|
|
10.0% KLD: 0.066346
|
|
5.0% KLD: 0.024022
|
|
1.0% KLD: 0.004240
|
|
Minimum KLD: 0.000159
|
|
|
|
====== Token probability statistics ======
|
|
Mean Δp: -16.552 ± 0.083 %
|
|
Maximum Δp: 94.307%
|
|
99.9% Δp: 72.111%
|
|
99.0% Δp: 47.813%
|
|
95.0% Δp: 23.079%
|
|
90.0% Δp: 10.497%
|
|
75.0% Δp: 0.030%
|
|
Median Δp: -4.029%
|
|
25.0% Δp: -29.474%
|
|
10.0% Δp: -70.364%
|
|
5.0% Δp: -90.072%
|
|
1.0% Δp: -99.761%
|
|
0.1% Δp: -99.966%
|
|
Minimum Δp: -99.998%
|
|
RMS Δp : 36.214 ± 0.088 %
|
|
Same top p: 64.455 ± 0.123 %
|