Files
rlvrmathif-qwen2.5-1.5b/eval-results/mbpp/metrics.json

11 lines
211 B
JSON
Raw Normal View History

{
"mbpp": {
"pass@1": {
"num_entries": 378,
"avg_tokens": 66,
"gen_seconds": 53,
"passing_base_tests": 61.111111111111114,
"passing_plus_tests": 52.645502645502646
}
}
}