Files
rlvrmathif-qwen2.5-1.5b/eval-results/human-eval/metrics.json

11 lines
217 B
JSON
Raw Normal View History

{
"human-eval": {
"pass@1": {
"num_entries": 164,
"avg_tokens": 109,
"gen_seconds": 53,
"passing_base_tests": 48.78048780487805,
"passing_plus_tests": 42.073170731707314
}
}
}