初始化项目,由ModelHub XC社区提供模型

Model: mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-21 17:00:38 +08:00
commit ca7a2024d2
27 changed files with 217 additions and 0 deletions

60
.gitattributes vendored Normal file
View File

@@ -0,0 +1,60 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
imatrix.dat filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text

82
README.md Normal file
View File

@@ -0,0 +1,82 @@
---
base_model: EpistemeAI/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus
language:
- en
library_name: transformers
license: apache-2.0
quantized_by: mradermacher
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- trl
---
## About
<!-- ### quantize_version: 2 -->
<!-- ### output_tensor_quantised: 1 -->
<!-- ### convert_type: hf -->
<!-- ### vocab_type: -->
<!-- ### tags: nicoboss -->
weighted/imatrix quants of https://huggingface.co/EpistemeAI/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus
<!-- provided-files -->
static quants are available at https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-GGUF
## Usage
If you are unsure how to use GGUF files, refer to one of [TheBloke's
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
more details, including on how to concatenate multi-part files.
## Provided Quants
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
| Link | Type | Size/GB | Notes |
|:-----|:-----|--------:|:------|
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ1_S.gguf) | i1-IQ1_S | 0.5 | for the desperate |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ1_M.gguf) | i1-IQ1_M | 0.5 | mostly desperate |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.5 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.6 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ2_S.gguf) | i1-IQ2_S | 0.6 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ2_M.gguf) | i1-IQ2_M | 0.6 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.7 | very low quality |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.7 | lower quality |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q2_K.gguf) | i1-Q2_K | 0.7 | IQ3_XXS probably better |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.7 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.7 | IQ3_XS probably better |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ3_S.gguf) | i1-IQ3_S | 0.7 | beats Q3_K* |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ3_M.gguf) | i1-IQ3_M | 0.8 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.8 | IQ3_S probably better |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q3_K_L.gguf) | i1-Q3_K_L | 0.8 | IQ3_M probably better |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ4_XS.gguf) | i1-IQ4_XS | 0.8 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-IQ4_NL.gguf) | i1-IQ4_NL | 0.9 | prefer IQ4_XS |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q4_0.gguf) | i1-Q4_0 | 0.9 | fast, low quality |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q4_K_S.gguf) | i1-Q4_K_S | 0.9 | optimal size/speed/quality |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q4_K_M.gguf) | i1-Q4_K_M | 0.9 | fast, recommended |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q4_1.gguf) | i1-Q4_1 | 0.9 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q5_K_S.gguf) | i1-Q5_K_S | 1.0 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q5_K_M.gguf) | i1-Q5_K_M | 1.0 | |
| [GGUF](https://huggingface.co/mradermacher/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus-i1-GGUF/resolve/main/Reasoning-Llama-3.2-1B-Instruct-v1.3-plus.i1-Q6_K.gguf) | i1-Q6_K | 1.1 | practically like static Q6_K |
Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):
![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
## FAQ / Model Request
See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.
## Thanks
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
<!-- end -->

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0d00764b34db4e603397e913f185ee2e540786a155449259021768fe80f36022
size 413607712

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:726cd0e6950bf5fda7e38fbb67303f0a0fc629fb237780e5990832a00edaef57
size 393553696

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c3629758daf263c7b8731e49c47571424af2c013eb015f28701c4e5327119d76
size 515450656

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b4276a27f27a55e80f10b6686ba767b148ca7f001ec1abf6bce1ff22c95e6e14
size 488711968

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a1d2567096517326a0d44d656630c7d3e06570be541074f312a00da733a269c9
size 475866912

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c976e6b722f046d76ed8398a120e2b2c0548d46a1f9aa02a9cd5b3b980797622
size 447031072

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:646c74a5c2174cb70d3d9d67f0e3a791f1308043817285b7587cc79cc7675370
size 657291040

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3fc1a812511169a09528875228a9ec357e9f8889ba8f60be64dc84aa43d88986
size 643921696

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:99a75d2dd44b09299027518b0e70cad1b14d72943ce10117c8b4a7855c0d7976
size 621115168

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ed79f5746e2c2f45064473fc1223c072940b29dc4b91e186f8af4350710bedbd
size 562112288

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:63213ff2a73cbde2380d9c8a05849dfa2891b012beee47d507f6882b9955d909
size 773027616

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:24ae5fcecf30285b8aa4591c7ef9783f8b68e993090fd7706d8e048914645989
size 743143200

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a955ce8ddbcc322d0abeb6c0c9b3d706f9de6b6f96cd0284ff3c8204eba862e7
size 580876064

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6f47ee9facf4052f60cbd498f5ca3f73276bca7a82059cfe4d0168f5f3c1e8d1
size 554661664

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:273831958d55765606941bc9958ab5d509dd90e630feb9516b5949f46748339c
size 732526368

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b78ba8ec84e23af894aa626cd390a3902894f9643a001c403d9e215c68b8d458
size 690845472

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:350e051bbe914fef9cbe7a561929247d2f95be1d4b245559d701b075e51149f4
size 641693472

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e08dbeaf46ca3d5cf90e79cbb5b75c4ff1f3c641167f47080f91541f04bfd626
size 773027616

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b96d7b8b059655908b08757b098d1fdcada5ac43043e367a851dd855d0b94744
size 831747872

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d8c9bd04acbcaee1f25eceec9eb8c1e4ec01cbf5a54ce718a00ce09e45298ee3
size 807696160

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9059258ab51832303988cc8bb867082510c757bed15911f77b4a927575917f4c
size 775649056

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5c6aa5045c9bd7cd45b882a0126b61d9963e906448c728a221659c74a2e3c09a
size 911505184

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9b232c88dff8fdb1be35580b6e22d0d16c7571fa74f5e5ec1cdceeeba088fc6f
size 892565280

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c7d151eaa93f71a630fb270a5290cc296a557581783a8681107dd63231333155
size 1021802272

3
imatrix.dat Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:72c9309f1c90fbc7a99e7d8b472a420c7b800539d1b448b9dbbfc5d01b9c4db7
size 1314413