初始化项目，由ModelHub XC社区提供模型

Model: ibm-granite/granite-4.1-30b-GGUF Source: Original Platform
2026-06-17 16:38:16 +08:00
commit 113fb50b55
21 changed files with 142 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,31 @@
+---
+pipeline_tag: text-generation
+inference: false
+license: apache-2.0
+library_name: transformers
+tags:
+- language
+- granite-4.1
+- gguf
+base_model:
+- ibm-granite/granite-4.1-30b
+---
+
+> [!NOTE]
+> This repository contains models that have been converted to the GGUF format with various quantizations from an IBM Granite `.safetensors` model.
+>
+> Please reference the base model's full model card here:
+> https://huggingface.co/ibm-granite/granite-4.1-30b
+
+### Merging the `.bf16` model
+
+The `bf16` model had to be split into multiple files to accommodate single file size restrictions 
+using the `llama-gguf-split` tool, with its default `--split` settings, which can be built from the [ggml-org/llama.cpp](https://github.com/ggml-org/llama.cpp) project.
+
+Use the following command to merge the split files which points to the first file in the sequence:
+
+```bash
+llama-gguf-split --merge granite-4.1-30b-bf16-00001-of-00005.gguf granite-4.1-30b-bf16.gguf
+```
+
+The remaining split filenames are inferred by the tool based upon the `00001-of-0000x` naming convention.