Files
rlvrmulti-qwen2.5-1.5b/eval-results/gsm8k/metrics.json

11 lines
201 B
JSON
Raw Normal View History

{
"gsm8k": {
"pass@1": {
"num_entries": 1319,
"avg_tokens": 341,
"gen_seconds": 46,
"symbolic_correct": 78.01364670204701,
"no_answer": 1.288855193328279
}
}
}