初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF Source: Original Platform
This commit is contained in:
60
.gitattributes
vendored
Normal file
60
.gitattributes
vendored
Normal file
@@ -0,0 +1,60 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_4_4.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_4_8.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_8_8.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ1_M.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c3612ce25555c09f75d5f4219a5b1e839ae5780ae2d51a7a9c5c5d49fcae569a
|
||||
size 1053473600
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ1_S.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8166feb1c8d6b9f89265ad9fe3fb77f9ebdbe13febc1be99dc2e789550918804
|
||||
size 997440320
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_M.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0287ef4ee6ef03556af1a281cab0039d330a484d5e4c57043f356eb2072a0804
|
||||
size 1398330176
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_S.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2185aca6c6944a1a091c07cc17a4f3c7f22c09921367a46255f4afd8217cb889
|
||||
size 1323619136
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_XS.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d734c29271e1f2e40447c03521698d520f3b9a314665027f512ef08211c4b3cf
|
||||
size 1229830976
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_XXS.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:1911b107871690f596452f5978a290d81aecd024f154bf7fb6309fa16209fcff
|
||||
size 1146862400
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_M.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d4d10ba8810b32513c4dfca524aa3a615d9c2149359832325e5169d1f8704eb6
|
||||
size 1768966976
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_S.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:7bc25fe31b59532c3f1c018aefe7acd0d20266c517e9ffe002dd2886c26ec981
|
||||
size 1712147264
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_XS.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:598695581c571992f769e924b7e626f065c61119bc36d78260574b45d5991ebf
|
||||
size 1646086976
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_XXS.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f353bc41be1d500347be9b899213ed31f2c7dbe9d520bac10530b28442872566
|
||||
size 1518064448
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ4_XS.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:189c36ccdb8bde2fd5239ce07d0cb4bfa44ddf59d170ce990bf8214484216fd5
|
||||
size 2038424384
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q2_K.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:fd0cb99489dcbbc805472e3b69cb92673d9cf31d0d1946fcbcafd89e4ee40a2c
|
||||
size 1493218112
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_L.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:818c29a960c5615525e8cb07c012c581553c21978b6f2d2125109067359daad6
|
||||
size 1984645952
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_M.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ba0f37ec8c4eb25bd109ba0823c738c7f359124c81503ddaedc9af7a1a063b01
|
||||
size 1856457536
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_S.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f56abf4807028992897aeb59fdfdec77a8f079d492cae641a1a775a6465ed646
|
||||
size 1712147264
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:fe1a16db3f4f687b2ebde22e42ee56d4c7191956da78a37e3c50dae6e1133754
|
||||
size 2143535936
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_4_4.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_4_4.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2908a67337817c10589e136b82089284c73376e61c135face13b4d657969333f
|
||||
size 2138817344
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_4_8.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_4_8.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ce16ffbea240204865243ef24f28caf1bdb9fbd280e81ccefff13f9fb5ef64b3
|
||||
size 2138817344
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_8_8.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_8_8.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e06886bbeb8d01f8b1b94399ddf1310535b5cd2636705eebbf2c8817dfb6f0c9
|
||||
size 2138817344
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_K_M.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a04467ef4f44d36883d1b48434edf1a0010a1b388548ed68648d5b50d6e0e5a3
|
||||
size 2241004352
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_K_S.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a9279e732aead43177e6cb96e6491542118c981da6475ddc06c2f4e972bfddfa
|
||||
size 2149827392
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q5_K_M.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:caa19deff94f68b78bba1079efc0df370f078704943b46f6b717cc5996c703f5
|
||||
size 2593030976
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q5_K_S.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:852ab503a74f7a9859f56b8e34b086dbf686b9e888cd24baaf2e5ff6d2ed77aa
|
||||
size 2540389184
|
||||
3
Llama-3.2-3B-Instruct-uncensored.i1-Q6_K.gguf
Normal file
3
Llama-3.2-3B-Instruct-uncensored.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:7020a5746cfe18154625fed8f6af3857dbb1a599b3ef420d32308df32251bd8c
|
||||
size 2967059264
|
||||
76
README.md
Normal file
76
README.md
Normal file
@@ -0,0 +1,76 @@
|
||||
---
|
||||
base_model: chuanli11/Llama-3.2-3B-Instruct-uncensored
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
quantized_by: mradermacher
|
||||
tags: []
|
||||
---
|
||||
## About
|
||||
|
||||
<!-- ### quantize_version: 2 -->
|
||||
<!-- ### output_tensor_quantised: 1 -->
|
||||
<!-- ### convert_type: hf -->
|
||||
<!-- ### vocab_type: -->
|
||||
<!-- ### tags: nicoboss -->
|
||||
weighted/imatrix quants of https://huggingface.co/chuanli11/Llama-3.2-3B-Instruct-uncensored
|
||||
|
||||
<!-- provided-files -->
|
||||
static quants are available at https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-GGUF
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details, including on how to concatenate multi-part files.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ1_S.gguf) | i1-IQ1_S | 1.1 | for the desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ1_M.gguf) | i1-IQ1_M | 1.2 | mostly desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 1.2 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ2_XS.gguf) | i1-IQ2_XS | 1.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ2_S.gguf) | i1-IQ2_S | 1.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ2_M.gguf) | i1-IQ2_M | 1.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q2_K.gguf) | i1-Q2_K | 1.6 | IQ3_XXS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 1.6 | lower quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ3_XS.gguf) | i1-IQ3_XS | 1.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ3_S.gguf) | i1-IQ3_S | 1.8 | beats Q3_K* |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_S.gguf) | i1-Q3_K_S | 1.8 | IQ3_XS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ3_M.gguf) | i1-IQ3_M | 1.9 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_M.gguf) | i1-Q3_K_M | 2.0 | IQ3_S probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q3_K_L.gguf) | i1-Q3_K_L | 2.1 | IQ3_M probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-IQ4_XS.gguf) | i1-IQ4_XS | 2.1 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_4_4.gguf) | i1-Q4_0_4_4 | 2.2 | fast on arm, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_4_8.gguf) | i1-Q4_0_4_8 | 2.2 | fast on arm+i8mm, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q4_0_8_8.gguf) | i1-Q4_0_8_8 | 2.2 | fast on arm+sve, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q4_0.gguf) | i1-Q4_0 | 2.2 | fast, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q4_K_S.gguf) | i1-Q4_K_S | 2.2 | optimal size/speed/quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q4_K_M.gguf) | i1-Q4_K_M | 2.3 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q5_K_S.gguf) | i1-Q5_K_S | 2.6 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q5_K_M.gguf) | i1-Q5_K_M | 2.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-Instruct-uncensored-i1-GGUF/resolve/main/Llama-3.2-3B-Instruct-uncensored.i1-Q6_K.gguf) | i1-Q6_K | 3.1 | practically like static Q6_K |
|
||||
|
||||
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||
types (lower is better):
|
||||
|
||||

|
||||
|
||||
And here are Artefact2's thoughts on the matter:
|
||||
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||
|
||||
## FAQ / Model Request
|
||||
|
||||
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||
questions you might have and/or if you want some other model quantized.
|
||||
|
||||
## Thanks
|
||||
|
||||
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||
me use its servers and providing upgrades to my workstation to enable
|
||||
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||
|
||||
<!-- end -->
|
||||
3
imatrix.dat
Normal file
3
imatrix.dat
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:98ecc7432f990c56c6136d6cde375165fbc7136c12226002436b445a71f99443
|
||||
size 2988377
|
||||
Reference in New Issue
Block a user