初始化项目,由ModelHub XC社区提供模型

Model: mradermacher/aquif-3.5-A4B-Think-i1-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-05 19:36:58 +08:00
commit e5a3d719a5
27 changed files with 237 additions and 0 deletions

60
.gitattributes vendored Normal file
View File

@@ -0,0 +1,60 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
aquif-3.5-A4B-Think.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text

102
README.md Normal file
View File

@@ -0,0 +1,102 @@
---
base_model: aquif-ai/aquif-3.5-A4B-Think
language:
- en
- de
- it
- pt
- fr
- hi
- es
- th
- zh
- ja
library_name: transformers
license: apache-2.0
mradermacher:
readme_rev: 1
quantized_by: mradermacher
tags:
- language
- aquif
- text-generation-inference
- math
- coding
- small
- aquif-3.5
---
## About
<!-- ### quantize_version: 2 -->
<!-- ### output_tensor_quantised: 1 -->
<!-- ### convert_type: hf -->
<!-- ### vocab_type: -->
<!-- ### tags: nicoboss -->
<!-- ### quants: Q2_K IQ3_M Q4_K_S IQ3_XXS Q3_K_M small-IQ4_NL Q4_K_M IQ2_M Q6_K IQ4_XS Q2_K_S IQ1_M Q3_K_S IQ2_XXS Q3_K_L IQ2_XS Q5_K_S IQ2_S IQ1_S Q5_K_M Q4_0 IQ3_XS Q4_1 IQ3_S -->
<!-- ### quants_skip: -->
<!-- ### skip_mmproj: -->
weighted/imatrix quants of https://huggingface.co/aquif-ai/aquif-3.5-A4B-Think
<!-- provided-files -->
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#aquif-3.5-A4B-Think-i1-GGUF).***
static quants are available at https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-GGUF
## Usage
If you are unsure how to use GGUF files, refer to one of [TheBloke's
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
more details, including on how to concatenate multi-part files.
## Provided Quants
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
| Link | Type | Size/GB | Notes |
|:-----|:-----|--------:|:------|
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ1_S.gguf) | i1-IQ1_S | 2.8 | for the desperate |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ1_M.gguf) | i1-IQ1_M | 3.0 | mostly desperate |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 3.4 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ2_XS.gguf) | i1-IQ2_XS | 3.8 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ2_S.gguf) | i1-IQ2_S | 3.9 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ2_M.gguf) | i1-IQ2_M | 4.2 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q2_K_S.gguf) | i1-Q2_K_S | 4.4 | very low quality |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q2_K.gguf) | i1-Q2_K | 4.7 | IQ3_XXS probably better |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 4.9 | lower quality |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ3_XS.gguf) | i1-IQ3_XS | 5.2 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q3_K_S.gguf) | i1-Q3_K_S | 5.5 | IQ3_XS probably better |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ3_S.gguf) | i1-IQ3_S | 5.5 | beats Q3_K* |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ3_M.gguf) | i1-IQ3_M | 5.6 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q3_K_M.gguf) | i1-Q3_K_M | 6.0 | IQ3_S probably better |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q3_K_L.gguf) | i1-Q3_K_L | 6.5 | IQ3_M probably better |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ4_XS.gguf) | i1-IQ4_XS | 6.7 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-IQ4_NL.gguf) | i1-IQ4_NL | 7.0 | prefer IQ4_XS |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q4_0.gguf) | i1-Q4_0 | 7.0 | fast, low quality |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q4_K_S.gguf) | i1-Q4_K_S | 7.1 | optimal size/speed/quality |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q4_K_M.gguf) | i1-Q4_K_M | 7.5 | fast, recommended |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q4_1.gguf) | i1-Q4_1 | 7.7 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q5_K_S.gguf) | i1-Q5_K_S | 8.5 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q5_K_M.gguf) | i1-Q5_K_M | 8.7 | |
| [GGUF](https://huggingface.co/mradermacher/aquif-3.5-A4B-Think-i1-GGUF/resolve/main/aquif-3.5-A4B-Think.i1-Q6_K.gguf) | i1-Q6_K | 10.0 | practically like static Q6_K |
Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):
![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
## FAQ / Model Request
See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.
## Thanks
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
<!-- end -->

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e049a959dc6ed89058e6089a9def313acc734bae71e7bfe68de46ffb714741a4
size 2926231296

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0af2d912d29497f34ae3cda490d72869036899d396276b7e0f041a0b9a6500fe
size 2672361216

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:bfbfecdea28c22043a87fe63a7ed80012b35ee57edee9cef4379951125b6fe4c
size 4131684096

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7686ab3898e0878d06eb304cee550854ea2a824b198798f680c1ac58f70dc700
size 3793190656

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f9e295251b53320b6ac02effcd0f43e3fa3d5d2d26deadc93ed761de6f68eeff
size 3699638016

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b89cfdea4926c03f2110cc0b24bb479fb8ce391f638333e6da13d03475b63d5e
size 3349348096

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8d87f19825e40d0b205cc60c8af494b699e5b6dcf63dd30904ca706707110114
size 5471124736

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1b8d1c82ba328d5c3b6cbf733bf7473b697fba913b92c8f8092ed9e6527772dc
size 5368069376

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e89812929f4989daa4068c2dcd852b7ec1fc607ceee42d7a9102d3dcf888f095
size 5093801216

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2534fb40a71557eb15084da3da27daa1f279f872d3ebc7c13ea3d2fd1b1a08c1
size 4760502016

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1384d01f30e142df82c28bd611dea8269cc6c60e9e3fbb7d7b90aa5e0e1c2b41
size 6921518336

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c823d61b9d0a70e79d8f65d60d64d60a85b34f45088105d4c569987d4449de81
size 6558776576

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e67bd52cc11e5f8b9c6516fc66e0b3fe5a64095e7c0a2950ef5bf608297ba52b
size 4591713536

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7e48ec9f83e1ad70db8d1ca879e3a85fe706c57d83a96c4b29ccb604ce621d5d
size 4282547456

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:dd82a010e856ee757e010afb4696dd80b03cf65ba0d849565f15640374810d7e
size 6401735936

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ba173392fe8a9a5b3cfe4343c72d6432ee500ffb8d6d3d7e799ee4f47e705f7c
size 5920046336

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:bbdca5e85bc1b6d6cd774801442ed15b4748f049f21544ef94f960ed89385c48
size 5355535616

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c139f9079e117657d1e31b767a76c6c823dff41e2975ad95df7cb637dd86843c
size 6934625536

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1c5863b1c1b27ffe549430fe5d8d5a367181a860bda8f2bceaaffffd7d712394
size 7641103616

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:dfa931a5526c9124e21c7d12d1832f1b0ca8db3e2b8adcb53085878daf9b8132
size 7384161536

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:60d29173710c9896fc1c7f5ca770000aedbe3da4903612f059e8ecbc19f4576c
size 6960839936

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:901dd82a459787908b2b18b880d8f81e7305f9cd11e2796f7472428b8c87df28
size 8616893696

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0d4048db7e346fccc4a999362e986cc68f8ecfdd282f995afa4faf05f26de8ed
size 8372485376

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:821d0f62ea78efed615f48974b632a4247886f39cc5dbea97e4efaf9d295b9a0
size 9926671616

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d9fbd216cfed4c3f4ea559d3566b4ac3387f292a8507bf92a694f0c722181a0d
size 10664256