Files

9 lines
589 B
CSV
Raw Permalink Normal View History

task,avg_k,pass_k,avg_total_tokens,avg_thinking_tokens,max_thinking_tokens,min_thinking_tokens
gpqa_diamond,0.5774111675126904,0.766497461928934,10612.370558375635,0.0,0.0,0.0
hmmt2025,0.31666666666666665,0.43333333333333335,18237.566666666666,0.0,0.0,0.0
aime2024,0.6802083333333333,0.9,14426.277083333332,0.0,0.0,0.0
aime2025,0.571875,0.8666666666666667,15452.192708333334,0.0,0.0,0.0
math500,0.7485,0.768,4490.75,0.0,0.0,0.0
minerva,0.3290441176470588,0.38235294117647056,6507.237132352941,0.0,0.0,0.0
overall,0.6000676132521975,0.6657223796033994,9346.815584854632,0.0,0.0,0.0