初始化项目,由ModelHub XC社区提供模型
Model: Avtrkrb/granite-claude-h-350m-GGUF Source: Original Platform
This commit is contained in:
41
.gitattributes
vendored
Normal file
41
.gitattributes
vendored
Normal file
@@ -0,0 +1,41 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
granite-claude-h-350m-F16.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
granite-claude-h-350m-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
granite-claude-h-350m-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
granite-claude-h-350m-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
granite-claude-h-350m-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
granite-claude-h-350m-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
88
README.md
Normal file
88
README.md
Normal file
@@ -0,0 +1,88 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
language:
|
||||
- en
|
||||
pipeline_tag: text-generation
|
||||
tags:
|
||||
- granite
|
||||
- gguf
|
||||
- llama-cpp
|
||||
- reasoning
|
||||
- quantized
|
||||
- local-llm
|
||||
|
||||
base_model: Avtrkrb/granite-claude-h-350m
|
||||
|
||||
library_name: gguf
|
||||
---
|
||||
|
||||
# granite-claude-h-350m-GGUF
|
||||
|
||||
GGUF quantizations of:
|
||||
|
||||
`Avtrkrb/granite-claude-h-350m`
|
||||
|
||||
These files are intended for inference using:
|
||||
|
||||
- llama.cpp
|
||||
- LM Studio
|
||||
- Open WebUI
|
||||
- Jan
|
||||
- KoboldCpp
|
||||
- GPT4All
|
||||
- Ollama (after conversion/import)
|
||||
|
||||
---
|
||||
|
||||
## Available Quantizations
|
||||
|
||||
Typical variants included:
|
||||
|
||||
| Quant | Use Case |
|
||||
|---------|---------|
|
||||
| Q4_K_M | Best size / quality balance |
|
||||
| Q5_K_M | Higher quality |
|
||||
| Q6_K | Near-lossless for most use cases |
|
||||
| Q8_0 | Highest quality quantized version |
|
||||
|
||||
---
|
||||
|
||||
## Source Model
|
||||
|
||||
Merged model:
|
||||
|
||||
https://huggingface.co/Avtrkrb/granite-claude-h-350m
|
||||
|
||||
Dataset:
|
||||
|
||||
https://huggingface.co/datasets/Avtrkrb/combined-reasoning-claude
|
||||
|
||||
---
|
||||
|
||||
## Example llama.cpp Usage
|
||||
|
||||
```bash
|
||||
./llama-cli \
|
||||
-m granite-claude-h-350m-Q4_K_M.gguf \
|
||||
-p "Explain quantum tunneling."
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Recommended Quant
|
||||
|
||||
For most users:
|
||||
|
||||
**Q4_K_M**
|
||||
|
||||
offers the best balance between:
|
||||
|
||||
- quality
|
||||
- speed
|
||||
- memory usage
|
||||
|
||||
---
|
||||
|
||||
## License
|
||||
|
||||
This repository follows the licensing terms of the original Granite model.
|
||||
3
granite-claude-h-350m-F16.gguf
Normal file
3
granite-claude-h-350m-F16.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:bd9e51ad0dbfea07a979e6bb33d6ab42ed19a362a03a3e12da1ef7ffa7a96b79
|
||||
size 839072288
|
||||
3
granite-claude-h-350m-Q4_0.gguf
Normal file
3
granite-claude-h-350m-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0b0260502a3f5d0c53740592e974c736e82147f1aa6dc6f50f7f77e1ff6b8220
|
||||
size 259425600
|
||||
3
granite-claude-h-350m-Q4_K_M.gguf
Normal file
3
granite-claude-h-350m-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:53f3855ea5bcea3583a66f1369a27d261b58ed0b64850a44a56a2981eb39b3c2
|
||||
size 266015040
|
||||
3
granite-claude-h-350m-Q5_K_M.gguf
Normal file
3
granite-claude-h-350m-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d7926fd19c8ee4317483946de215c9e6437c9060624842e3deeef7d48274fef8
|
||||
size 305318208
|
||||
3
granite-claude-h-350m-Q6_K.gguf
Normal file
3
granite-claude-h-350m-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:92d1596c0047ce3681ae80e1993429261177eb4ddb8e6b4bf3e850498f173370
|
||||
size 347077824
|
||||
3
granite-claude-h-350m-Q8_0.gguf
Normal file
3
granite-claude-h-350m-Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:6b33768b9eb9c0a34b2dcdc02d8afc391cd8d65c6b3cdeca6ba080798d4a4d4a
|
||||
size 448083264
|
||||
Reference in New Issue
Block a user