llama_model_loader: loaded meta data with 39 key-value pairs and 363 tensors from granite-4.1-8b-Q5_K.gguf (version GGUF V3 (latest)) llama_model_loader: - type f32: 81 tensors llama_model_loader: - type q5_1: 1 tensors llama_model_loader: - type q4_K: 26 tensors llama_model_loader: - type q5_K: 246 tensors llama_model_loader: - type q6_K: 4 tensors llama_model_loader: - type iq4_xs: 5 tensors print_info: file format = GGUF V3 (latest) print_info: file type = Q5_K - Medium print_info: file size = 5.63 GiB (5.50 BPW) multiple_choice_score: there are 198 tasks in prompt multiple_choice_score: reading tasks......................................................................................................................................................................................................done multiple_choice_score : calculating GPQA-Diamond score over 198 tasks. Final result: 22.7273 +/- 2.9858 Random chance: 24.5963 +/- 3.0683