初始化项目,由ModelHub XC社区提供模型

Model: mradermacher/AceInstruct-7B-i1-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-09 02:05:06 +08:00
commit f69d8829a3
26 changed files with 215 additions and 0 deletions

59
.gitattributes vendored Normal file
View File

@@ -0,0 +1,59 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
AceInstruct-7B.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e7600d2b66a19a509c5c7777c84efbb90e18df6b1010412e5bd82475765b298e
size 2042194464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:11a0c8e62c14ff9c1a4ee6c4528a2504bc02b0591a9c38167866057af702494e
size 1903665696

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:983d5eb973b4c6089e568b91170af08e48a8e753306ae4493deb7826ea400e4c
size 2780340768

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:617d50bdec9ba7ed11aff79291d2f9f1380a6284692452428c2d745798151087
size 2595635744

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5402831062b80080c9a6d774039d6e8a1ff7ca443d2d7f83b819375af96f7aa8
size 2469020192

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4eb139636e78dea1c10efde4c06e5cd92bc074c0d45bcfca0830d1786b52dc34
size 2273075744

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:65b668a7d456b33baab1923d5cc0807c3cdf9fba35c32dd49c4657755a288760
size 3574010400

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0aaa5f1f0cdc4719bdb5c90f51217e1e5b24a2a73b42a790b9f927cd2b67e68e
size 3499190816

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:79d99bdb88e92aefc2cdce234882d7184df04bcd5f6a63ae00e9572ea5910f4c
size 3346254368

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:565c365df847b378ccee1553440fec06eeed79d256c440fd0772d5e88b936cb2
size 3114512928

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f332cdd8887beae4e86fb052639797707774ecd947c791ee33d69c4f293ab69d
size 4437811744

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a78615fc9b14f8025f17e27634e9f0471053da4c6ac4fe7b9b5936f77899f41a
size 4218470944

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1dde90421493e078a31590f7334d527daf401ab59799b0ebd2332c393ea15b43
size 3015938592

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b4da5958a2a5eeb41e28b9bbfa04b233c167885bb040cb8abe01b3449ec9df71
size 2834072096

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:51b4ff2cb51d100371e6e7cc226231d5e7bb2fac2c21c7f8e755000679653619
size 4088457760

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3a1e144537e7ec8df25e6f86a816727dc308f939679dfa24c6bf85edaaf128db
size 3808389664

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6563fb4d3a3810760506fe36a3aa781ae8adbba13a3f7de8b6be226edc95629a
size 3492366880

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:42cdfbd62831974f1541014cce6e2cabd63f33100a31deeafd0aa12e23724805
size 4444119584

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1ea51bdf8981a7f413c916cc622adf123e11ef2b137cb7e4092f430d1b27ba1b
size 4873282080

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:dbc57cfa18e5722ccb29624ff7d02680aff570e160db8839facc5b37b0741895
size 4683072032

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4a7fd501ba6ffd6a7cd359ef3c0bcff79bb0ee29a98a86fc25632a1b8b4a5238
size 4457767456

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:72a8997943d0ea0018e6b94e7c889f3dada9ad4bbc9d20d332ed1d9563f95665
size 5444829728

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6b946f099d4db117b3d7b58552c828f38fcc3389b460deb47d5997c3da46f51a
size 5315174944

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2fc9dc3dfe46a16a6f8544ea5527a949f1087ea617255adeb81c5ded60cc9eb1
size 6254197280

84
README.md Normal file
View File

@@ -0,0 +1,84 @@
---
base_model: nvidia/AceInstruct-7B
language:
- en
library_name: transformers
license: cc-by-nc-4.0
quantized_by: mradermacher
tags:
- nvidia
- AceInstruct
- code
- math
- general_domain
- instruct_model
- pytorch
---
## About
<!-- ### quantize_version: 2 -->
<!-- ### output_tensor_quantised: 1 -->
<!-- ### convert_type: hf -->
<!-- ### vocab_type: -->
<!-- ### tags: nicoboss -->
weighted/imatrix quants of https://huggingface.co/nvidia/AceInstruct-7B
<!-- provided-files -->
static quants are available at https://huggingface.co/mradermacher/AceInstruct-7B-GGUF
## Usage
If you are unsure how to use GGUF files, refer to one of [TheBloke's
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
more details, including on how to concatenate multi-part files.
## Provided Quants
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
| Link | Type | Size/GB | Notes |
|:-----|:-----|--------:|:------|
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ1_S.gguf) | i1-IQ1_S | 2.0 | for the desperate |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ1_M.gguf) | i1-IQ1_M | 2.1 | mostly desperate |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.4 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.6 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ2_S.gguf) | i1-IQ2_S | 2.7 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ2_M.gguf) | i1-IQ2_M | 2.9 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q2_K_S.gguf) | i1-Q2_K_S | 2.9 | very low quality |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q2_K.gguf) | i1-Q2_K | 3.1 | IQ3_XXS probably better |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.2 | lower quality |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.4 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.6 | IQ3_XS probably better |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ3_S.gguf) | i1-IQ3_S | 3.6 | beats Q3_K* |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ3_M.gguf) | i1-IQ3_M | 3.7 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q3_K_M.gguf) | i1-Q3_K_M | 3.9 | IQ3_S probably better |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.2 | IQ3_M probably better |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.3 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.5 | prefer IQ4_XS |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q4_0.gguf) | i1-Q4_0 | 4.5 | fast, low quality |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.6 | optimal size/speed/quality |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q4_K_M.gguf) | i1-Q4_K_M | 4.8 | fast, recommended |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q4_1.gguf) | i1-Q4_1 | 5.0 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.4 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.5 | |
| [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q6_K.gguf) | i1-Q6_K | 6.4 | practically like static Q6_K |
Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):
![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
## FAQ / Model Request
See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.
## Thanks
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
<!-- end -->