初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/boto-27B-i1-GGUF Source: Original Platform
This commit is contained in:
57
.gitattributes
vendored
Normal file
57
.gitattributes
vendored
Normal file
@@ -0,0 +1,57 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
boto-27B.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
82
README.md
Normal file
82
README.md
Normal file
@@ -0,0 +1,82 @@
|
||||
---
|
||||
base_model: lucianosb/boto-27B
|
||||
datasets:
|
||||
- lucianosb/cetacean-ptbr
|
||||
language:
|
||||
- pt
|
||||
library_name: transformers
|
||||
license: apache-2.0
|
||||
quantized_by: mradermacher
|
||||
tags:
|
||||
- text-generation-inference
|
||||
- transformers
|
||||
- unsloth
|
||||
- gemma2
|
||||
- trl
|
||||
- sft
|
||||
---
|
||||
## About
|
||||
|
||||
<!-- ### quantize_version: 2 -->
|
||||
<!-- ### output_tensor_quantised: 1 -->
|
||||
<!-- ### convert_type: hf -->
|
||||
<!-- ### vocab_type: -->
|
||||
<!-- ### tags: nicoboss -->
|
||||
weighted/imatrix quants of https://huggingface.co/lucianosb/boto-27B
|
||||
|
||||
<!-- provided-files -->
|
||||
static quants are available at https://huggingface.co/mradermacher/boto-27B-GGUF
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details, including on how to concatenate multi-part files.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ1_S.gguf) | i1-IQ1_S | 6.2 | for the desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ1_M.gguf) | i1-IQ1_M | 6.8 | mostly desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 7.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ2_XS.gguf) | i1-IQ2_XS | 8.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ2_S.gguf) | i1-IQ2_S | 8.8 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ2_M.gguf) | i1-IQ2_M | 9.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q2_K.gguf) | i1-Q2_K | 10.5 | IQ3_XXS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 10.9 | lower quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ3_XS.gguf) | i1-IQ3_XS | 11.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ3_S.gguf) | i1-IQ3_S | 12.3 | beats Q3_K* |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q3_K_S.gguf) | i1-Q3_K_S | 12.3 | IQ3_XS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ3_M.gguf) | i1-IQ3_M | 12.6 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q3_K_M.gguf) | i1-Q3_K_M | 13.5 | IQ3_S probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q3_K_L.gguf) | i1-Q3_K_L | 14.6 | IQ3_M probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-IQ4_XS.gguf) | i1-IQ4_XS | 14.9 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q4_0.gguf) | i1-Q4_0 | 15.8 | fast, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q4_K_S.gguf) | i1-Q4_K_S | 15.8 | optimal size/speed/quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q4_K_M.gguf) | i1-Q4_K_M | 16.7 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q5_K_S.gguf) | i1-Q5_K_S | 19.0 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q5_K_M.gguf) | i1-Q5_K_M | 19.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/boto-27B-i1-GGUF/resolve/main/boto-27B.i1-Q6_K.gguf) | i1-Q6_K | 22.4 | practically like static Q6_K |
|
||||
|
||||
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||
types (lower is better):
|
||||
|
||||

|
||||
|
||||
And here are Artefact2's thoughts on the matter:
|
||||
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||
|
||||
## FAQ / Model Request
|
||||
|
||||
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||
questions you might have and/or if you want some other model quantized.
|
||||
|
||||
## Thanks
|
||||
|
||||
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||
me use its servers and providing upgrades to my workstation to enable
|
||||
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||
|
||||
<!-- end -->
|
||||
3
boto-27B.i1-IQ1_M.gguf
Normal file
3
boto-27B.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:31390b210ac0b56f03403d7845356667837ef8caa3337097d52e8f0683e5b76b
|
||||
size 6692471040
|
||||
3
boto-27B.i1-IQ1_S.gguf
Normal file
3
boto-27B.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:154ac051bfc21568c5117e1092abcf07db0305e363c7c665dc800b3694b5553b
|
||||
size 6132433152
|
||||
3
boto-27B.i1-IQ2_M.gguf
Normal file
3
boto-27B.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e3a196129786188b91e438e89dfa3107138654d3644ce4d1e59c16748cc33fd2
|
||||
size 9398878464
|
||||
3
boto-27B.i1-IQ2_S.gguf
Normal file
3
boto-27B.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:214b5f361ef3784977a4f481084387f02f21e64f18235002962c150bee7afac8
|
||||
size 8652161280
|
||||
3
boto-27B.i1-IQ2_XS.gguf
Normal file
3
boto-27B.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:cfa9a0be023fce65e9d43febfbedfe5de3c7fc44f0a8ea89bac78bf1d4747b28
|
||||
size 8399716608
|
||||
3
boto-27B.i1-IQ2_XXS.gguf
Normal file
3
boto-27B.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d42bbf8f05ead17389f69cea603ceffa202716356098ab8e37f866c484adf99a
|
||||
size 7625867520
|
||||
3
boto-27B.i1-IQ3_M.gguf
Normal file
3
boto-27B.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ccf1e8dbe2dedc5d5a27e96442e7c9d1307b34385edfb273f66183eebaef65eb
|
||||
size 12454830336
|
||||
3
boto-27B.i1-IQ3_S.gguf
Normal file
3
boto-27B.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:28dca6a87c167f9cb4d0fdea322f6f1ccd2355b895fc9bec790e9710181055f6
|
||||
size 12169060608
|
||||
3
boto-27B.i1-IQ3_XS.gguf
Normal file
3
boto-27B.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:072c75c53aa4e0a0353b723e507588b9684c2a9d9a8a431b0452fe6b03cc6155
|
||||
size 11550630144
|
||||
3
boto-27B.i1-IQ3_XXS.gguf
Normal file
3
boto-27B.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c37db05a7c3e223eb0bcd0db91050f6458701446807371b02b858535797b2ed8
|
||||
size 10750755072
|
||||
3
boto-27B.i1-IQ4_XS.gguf
Normal file
3
boto-27B.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c996e1578d8aa864c726d612eed6e8db7ce20c67ce29be74530c0e6b5d6c150d
|
||||
size 14814421248
|
||||
3
boto-27B.i1-Q2_K.gguf
Normal file
3
boto-27B.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:755c018486db959aa6778ab7a84633799195dac7a3a25db8856fa34cca202c1f
|
||||
size 10449576192
|
||||
3
boto-27B.i1-Q3_K_L.gguf
Normal file
3
boto-27B.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ddec849346bcd8ab7b5062d67dc3b6b44d9471074f0e5c88ff83057274db1c07
|
||||
size 14519361792
|
||||
3
boto-27B.i1-Q3_K_M.gguf
Normal file
3
boto-27B.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8c80dd9a5b5b4cc164a23193877b419bb424e3e799fb90307e7a969506990a31
|
||||
size 13424648448
|
||||
3
boto-27B.i1-Q3_K_S.gguf
Normal file
3
boto-27B.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:890e928d2f5e5281178cddad8b0e9ed7e8c6bc0aa314a1295faafa46e513cbdb
|
||||
size 12169060608
|
||||
3
boto-27B.i1-Q4_0.gguf
Normal file
3
boto-27B.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:541f6812864c9f61b54490c35bf321515a35ebde50ce67c15cc636e2619dca65
|
||||
size 15681462528
|
||||
3
boto-27B.i1-Q4_K_M.gguf
Normal file
3
boto-27B.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:b5bf7a7f94b488016532eb5eec8047f2b079941d17801f26b0939dc2329698e2
|
||||
size 16645382400
|
||||
3
boto-27B.i1-Q4_K_S.gguf
Normal file
3
boto-27B.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3ed298fc35f724d20c3929bb1afd6be6d535ce95d10311782a3eef83da642d46
|
||||
size 15739265280
|
||||
3
boto-27B.i1-Q5_K_M.gguf
Normal file
3
boto-27B.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:6dab03c03e46b0ed0129247f6e1d64ea2da280e57a9c9b7c5175b1b22b8f7ba6
|
||||
size 19408118016
|
||||
3
boto-27B.i1-Q5_K_S.gguf
Normal file
3
boto-27B.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:669652121fca5779d7ebc5b1d18cdb3c3fbf108d265effb3afdb41966250849d
|
||||
size 18884206848
|
||||
3
boto-27B.i1-Q6_K.gguf
Normal file
3
boto-27B.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:61ea779dd4ad81656164daec947ca6586176590708557e1293a667024969cb08
|
||||
size 22343524608
|
||||
3
imatrix.dat
Normal file
3
imatrix.dat
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:5126208234232b738f3602a54fe2298ad329e304a8b3cbcb823732477d4df61a
|
||||
size 11786683
|
||||
Reference in New Issue
Block a user