初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF Source: Original Platform
This commit is contained in:
60
.gitattributes
vendored
Normal file
60
.gitattributes
vendored
Normal file
@@ -0,0 +1,60 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Llama3.2-1B-STEM-Full.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ1_M.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:74c7ba38e2fc031d59b552c16adae2f85079a3648e28341fecb746fc5a5767b7
|
||||
size 413607616
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ1_S.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2e83eba85b05c094b952da116987ccfa1cf7dc52ceb1e4f82d3b25c1795dea3d
|
||||
size 393553600
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ2_M.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:bcd29730d890bf2e555b0f520dbb9c5490c11ca0efd23934ccc216520d08b78d
|
||||
size 515450560
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ2_S.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:1a93c144b7806b2fe3456eff3516176ebc3a268916f21c9e8ba35eb6b74e7fe3
|
||||
size 488711872
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ2_XS.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f0a71c780578ec0e97571043b7df85241b4dec065c3fd418f437dbc719ff25e4
|
||||
size 475866816
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ2_XXS.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:bdd837d88452d8e3382f52bb2d1e3d9f49d39ac4949bce49e2f2ace99236e1ca
|
||||
size 447030976
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ3_M.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8a229d0cdf3ef36a6f51b55c42911215342549bd5d264928e3668fae25b53998
|
||||
size 657290944
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ3_S.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:b611ca275acd0d28af9ef78c06f2b28eebed3659d9c5445f1a19b182ef9b5afa
|
||||
size 643921600
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ3_XS.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2894c804c3b8987b9fd76f4c4ab705d9833d05f11734c0116a9cc826459786f0
|
||||
size 621115072
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ3_XXS.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:564cf5914d1eb45b4fd1a0fda19295ea4d39d6b461f9049c56c4bd575e1aaff2
|
||||
size 562112192
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ4_NL.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ4_NL.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:df28c7e82b925f0a1d52e2cb1f33b2abd39c980cbea4461b54c685bd4c07fa74
|
||||
size 773027520
|
||||
3
Llama3.2-1B-STEM-Full.i1-IQ4_XS.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:b98328b2ce4761dff402a12fb02d06d5a0e4053a040363a9f56096e6b8765a3b
|
||||
size 743143104
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q2_K.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0c8f055737297d6a18c841ea2487af95e661228113d04f4bceded9664f82056a
|
||||
size 580875968
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q2_K_S.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q2_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:1d3ebe13d588dcf5009c04874922892d82b21bf34b995aa35df681c7d573df7d
|
||||
size 554661568
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q3_K_L.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:15cf720f4998159cfc3526e5c4c94fb144a8acf24807910c1b367ecba6923817
|
||||
size 732526272
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q3_K_M.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:025d4eb734c9ec959b675b3cb67e0253bce64ab6cd67d3d36696fe780daf528d
|
||||
size 690845376
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q3_K_S.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:088b6a8c72d6f1dd1af51637a6bae3c3e34bd151c19bd8f5081a5f857f13b104
|
||||
size 641693376
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q4_0.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8b31ca129f5971231d4222a02f74585e2ac35481586402f0e415c1ea776bcaf8
|
||||
size 773027520
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q4_1.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:78e1e833b06980e750beb6a946ebde9fecbcfc95f3642cb5df8560c1fa578416
|
||||
size 831747776
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q4_K_M.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:106e9904b0afe17a80c755547afa99dc786d45a5c33617ada3edd9f0829c8035
|
||||
size 807696064
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q4_K_S.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:77b3a289a02ed9aa7357bcb10e04b178cd34f9d3a91273a443d2e926d597e428
|
||||
size 775648960
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q5_K_M.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:5afea7c8741e1cbed23c73dc6cfba40492a135c0feda2e9a1e8aad4909ad5f94
|
||||
size 911505088
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q5_K_S.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:22c8677a42b22c9e9cc154839583876af4b0471bd409aac909ce7d7295019738
|
||||
size 892565184
|
||||
3
Llama3.2-1B-STEM-Full.i1-Q6_K.gguf
Normal file
3
Llama3.2-1B-STEM-Full.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:6ecd817b6066a2308c68f6a834c0cb939174e9d5666079b75fef75978c630857
|
||||
size 1021802176
|
||||
3
Llama3.2-1B-STEM-Full.imatrix.gguf
Normal file
3
Llama3.2-1B-STEM-Full.imatrix.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:be4dc552561ac02b29bca6922baad4c5926117488ec8f9e41b129670df645512
|
||||
size 1328000
|
||||
92
README.md
Normal file
92
README.md
Normal file
@@ -0,0 +1,92 @@
|
||||
---
|
||||
base_model: theprint/Llama3.2-1B-STEM-Full
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
license: apache-2.0
|
||||
mradermacher:
|
||||
readme_rev: 1
|
||||
quantized_by: mradermacher
|
||||
tags:
|
||||
- text-generation-inference
|
||||
- transformers
|
||||
- unsloth
|
||||
- llama
|
||||
- trl
|
||||
- sft
|
||||
---
|
||||
## About
|
||||
|
||||
<!-- ### quantize_version: 2 -->
|
||||
<!-- ### output_tensor_quantised: 1 -->
|
||||
<!-- ### convert_type: hf -->
|
||||
<!-- ### vocab_type: -->
|
||||
<!-- ### tags: nicoboss -->
|
||||
<!-- ### quants: Q2_K IQ3_M Q4_K_S IQ3_XXS Q3_K_M small-IQ4_NL Q4_K_M IQ2_M Q6_K IQ4_XS Q2_K_S IQ1_M Q3_K_S IQ2_XXS Q3_K_L IQ2_XS Q5_K_S IQ2_S IQ1_S Q5_K_M Q4_0 IQ3_XS Q4_1 IQ3_S -->
|
||||
<!-- ### quants_skip: -->
|
||||
<!-- ### skip_mmproj: -->
|
||||
weighted/imatrix quants of https://huggingface.co/theprint/Llama3.2-1B-STEM-Full
|
||||
|
||||
<!-- provided-files -->
|
||||
|
||||
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Llama3.2-1B-STEM-Full-i1-GGUF).***
|
||||
|
||||
static quants are available at https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-GGUF
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details, including on how to concatenate multi-part files.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ1_S.gguf) | i1-IQ1_S | 0.5 | for the desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ1_M.gguf) | i1-IQ1_M | 0.5 | mostly desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.6 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ2_S.gguf) | i1-IQ2_S | 0.6 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ2_M.gguf) | i1-IQ2_M | 0.6 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.7 | very low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.7 | lower quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q2_K.gguf) | i1-Q2_K | 0.7 | IQ3_XXS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.7 | IQ3_XS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ3_S.gguf) | i1-IQ3_S | 0.7 | beats Q3_K* |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ3_M.gguf) | i1-IQ3_M | 0.8 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.8 | IQ3_S probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q3_K_L.gguf) | i1-Q3_K_L | 0.8 | IQ3_M probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ4_XS.gguf) | i1-IQ4_XS | 0.8 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-IQ4_NL.gguf) | i1-IQ4_NL | 0.9 | prefer IQ4_XS |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q4_0.gguf) | i1-Q4_0 | 0.9 | fast, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q4_K_S.gguf) | i1-Q4_K_S | 0.9 | optimal size/speed/quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q4_K_M.gguf) | i1-Q4_K_M | 0.9 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q4_1.gguf) | i1-Q4_1 | 0.9 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q5_K_S.gguf) | i1-Q5_K_S | 1.0 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q5_K_M.gguf) | i1-Q5_K_M | 1.0 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Llama3.2-1B-STEM-Full-i1-GGUF/resolve/main/Llama3.2-1B-STEM-Full.i1-Q6_K.gguf) | i1-Q6_K | 1.1 | practically like static Q6_K |
|
||||
|
||||
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||
types (lower is better):
|
||||
|
||||

|
||||
|
||||
And here are Artefact2's thoughts on the matter:
|
||||
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||
|
||||
## FAQ / Model Request
|
||||
|
||||
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||
questions you might have and/or if you want some other model quantized.
|
||||
|
||||
## Thanks
|
||||
|
||||
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||
me use its servers and providing upgrades to my workstation to enable
|
||||
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||
|
||||
<!-- end -->
|
||||
Reference in New Issue
Block a user