Files
rlvrmath-qwen2.5-1.5b/eval-results/mbpp/metrics.json

11 lines
208 B
JSON
Raw Normal View History

{
"mbpp": {
"pass@1": {
"num_entries": 378,
"avg_tokens": 67,
"gen_seconds": 27,
"passing_base_tests": 63.22751322751323,
"passing_plus_tests": 54.4973544973545
}
}
}