llama_model_loader: loaded meta data with 39 key-value pairs and 363 tensors from granite-4.1-8b-Q7_K.gguf (version GGUF V3 (latest)) llama_model_loader: - type f32: 81 tensors llama_model_loader: - type q5_1: 4 tensors llama_model_loader: - type q8_0: 146 tensors llama_model_loader: - type q6_K: 132 tensors print_info: file format = GGUF V3 (latest) print_info: file type = Q8_0 print_info: file size = 7.68 GiB (7.50 BPW) multiple_choice_score: there are 198 tasks in prompt multiple_choice_score: reading tasks......................................................................................................................................................................................................done multiple_choice_score : calculating GPQA-Diamond score over 198 tasks. Final result: 26.7677 +/- 3.1544 Random chance: 24.5963 +/- 3.0683