Compare commits

...

10 Commits

Author SHA1 Message Date
team mradermacher
7ed1c0d302 auto-patch README.md 2025-02-16 14:02:19 +00:00
team mradermacher
cfa0e7607c uploaded from rich1 2025-02-16 14:01:04 +00:00
team mradermacher
cad6e119f5 uploaded from rich1 2025-02-16 13:59:42 +00:00
team mradermacher
a7b058a4b1 auto-patch README.md 2025-02-16 13:58:02 +00:00
team mradermacher
151a4f2cc4 uploaded from rich1 2025-02-16 13:56:12 +00:00
team mradermacher
af95c98037 uploaded from rich1 2025-02-16 13:54:46 +00:00
team mradermacher
4ba9fc7d4e uploaded from rich1 2025-02-16 13:51:52 +00:00
team mradermacher
dfc811814b uploaded from rich1 2025-02-16 13:49:39 +00:00
team mradermacher
d783bf5981 uploaded from rich1 2025-02-16 13:49:04 +00:00
team mradermacher
a441050cc7 uploaded from rich1 2025-02-16 13:48:35 +00:00
10 changed files with 91 additions and 0 deletions

8
.gitattributes vendored
View File

@@ -36,3 +36,11 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
MN-Sappho-d-12B.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:573367349cb60da7a752f11f6685457d7637eabf3c36a2a75962db5b20790a36
size 6800054624

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:161d9b1e136dbdaaba52b3a4b7dc8b6a29033cf198186e5ceb02e6b158475b8b
size 6561503584

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:53da41c7c734479195ffbb9c521d4bbbec1600c477efbe6320b09fc54f403bb0
size 6083090784

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fbca9079ca91616414f29cab2520d518e0f7dc22290edb8f3f7e01e4004f63db
size 5534226784

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d93c7520d010f9246b85c5fbc5b91286ebc3ceeb47c8c41fb45a5819b5301225
size 7477205344

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2a46f44a9519675d74f16996ea67e39f855f93e86104226c35e934302acf2a5a
size 8727632224

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a564a0787699b711760238e51e841aca75d920165f6a15aaeb1f80804c906223
size 8518736224

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:16d4e8cd1370b7d95b39f536e6a9f7bd65ed9e3ad892788751a0b848167bda30
size 10056210784

View File

@@ -1,6 +1,65 @@
---
base_model: mergekit-community/MN-Sappho-d-12B
language:
- en
library_name: transformers
quantized_by: mradermacher
tags:
- mergekit
- merge
---
## About
<!-- ### quantize_version: 2 -->
<!-- ### output_tensor_quantised: 1 -->
<!-- ### convert_type: hf -->
<!-- ### vocab_type: -->
<!-- ### tags: -->
static quants of https://huggingface.co/mergekit-community/MN-Sappho-d-12B
<!-- provided-files -->
weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion.
## Usage
If you are unsure how to use GGUF files, refer to one of [TheBloke's
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
more details, including on how to concatenate multi-part files.
## Provided Quants
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
| Link | Type | Size/GB | Notes |
|:-----|:-----|--------:|:------|
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q2_K.gguf) | Q2_K | 4.9 | |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q3_K_S.gguf) | Q3_K_S | 5.6 | |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q3_K_M.gguf) | Q3_K_M | 6.2 | lower quality |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q3_K_L.gguf) | Q3_K_L | 6.7 | |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.IQ4_XS.gguf) | IQ4_XS | 6.9 | |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q4_K_S.gguf) | Q4_K_S | 7.2 | fast, recommended |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q4_K_M.gguf) | Q4_K_M | 7.6 | fast, recommended |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q5_K_S.gguf) | Q5_K_S | 8.6 | |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q5_K_M.gguf) | Q5_K_M | 8.8 | |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q6_K.gguf) | Q6_K | 10.2 | very good quality |
| [GGUF](https://huggingface.co/mradermacher/MN-Sappho-d-12B-GGUF/resolve/main/MN-Sappho-d-12B.Q8_0.gguf) | Q8_0 | 13.1 | fast, best quality |
Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):
![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
## FAQ / Model Request
See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.
## Thanks
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time.
<!-- end -->