Files

11 lines
186 B
JSON
Raw Permalink Normal View History

{
"hendrycks_math": {
"pass@1": {
"num_entries": 5000,
"avg_tokens": 672,
"gen_seconds": 142,
"symbolic_correct": 53.22,
"no_answer": 7.62
}
}
}