Files
rlvrmulti-qwen2.5-1.5b/eval-results/mbpp/metrics.json

11 lines
209 B
JSON
Raw Normal View History

{
"mbpp": {
"pass@1": {
"num_entries": 378,
"avg_tokens": 65,
"gen_seconds": 21,
"passing_base_tests": 66.4021164021164,
"passing_plus_tests": 57.142857142857146
}
}
}