Files
BastiAI-2-Instruct/llmtf_eval/darumeru_MultiQ_total.jsonl

9 lines
177 B
Plaintext
Raw Normal View History

{
"task_name": "darumeru/MultiQ",
"results": {
"f1": 0.3346248767848689,
"em": 0.22275334608030592
},
"leaderboard_result": 0.2786891114325874
}