38 lines
1.0 KiB
Plaintext
38 lines
1.0 KiB
Plaintext
====== Perplexity statistics ======
|
|
Mean PPL(Q) : 19.313300 ± 0.150799
|
|
Mean PPL(base) : 6.180577 ± 0.041038
|
|
Cor(ln(PPL(Q)), ln(PPL(base))): 74.61%
|
|
Mean ln(PPL(Q)/PPL(base)) : 1.139382 ± 0.005262
|
|
Mean PPL(Q)/PPL(base) : 3.124838 ± 0.016443
|
|
Mean PPL(Q)-PPL(base) : 13.132723 ± 0.123247
|
|
|
|
====== KL divergence statistics ======
|
|
Mean KLD: 1.248712 ± 0.004216
|
|
Maximum KLD: 28.765745
|
|
99.9% KLD: 13.682988
|
|
99.0% KLD: 8.611128
|
|
99.0% KLD: 8.611128
|
|
Median KLD: 0.769048
|
|
10.0% KLD: 0.075574
|
|
5.0% KLD: 0.027425
|
|
1.0% KLD: 0.004777
|
|
Minimum KLD: 0.000121
|
|
|
|
====== Token probability statistics ======
|
|
Mean Δp: -17.672 ± 0.084 %
|
|
Maximum Δp: 94.089%
|
|
99.9% Δp: 73.751%
|
|
99.0% Δp: 50.009%
|
|
95.0% Δp: 22.814%
|
|
90.0% Δp: 9.454%
|
|
75.0% Δp: 0.004%
|
|
Median Δp: -4.889%
|
|
25.0% Δp: -32.188%
|
|
10.0% Δp: -72.799%
|
|
5.0% Δp: -91.232%
|
|
1.0% Δp: -99.761%
|
|
0.1% Δp: -99.966%
|
|
Minimum Δp: -99.998%
|
|
RMS Δp : 37.260 ± 0.088 %
|
|
Same top p: 62.749 ± 0.124 %
|