初始化项目，由ModelHub XC社区提供模型

Model: mradermacher/AceInstruct-7B-i1-GGUF Source: Original Platform
2026-05-09 02:05:06 +08:00
commit f69d8829a3
26 changed files with 215 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -0,0 +1,59 @@
 *.7z filter=lfs diff=lfs merge=lfs -text
 *.arrow filter=lfs diff=lfs merge=lfs -text
 *.bin filter=lfs diff=lfs merge=lfs -text
 *.bz2 filter=lfs diff=lfs merge=lfs -text
 *.ckpt filter=lfs diff=lfs merge=lfs -text
 *.ftz filter=lfs diff=lfs merge=lfs -text
 *.gz filter=lfs diff=lfs merge=lfs -text
 *.h5 filter=lfs diff=lfs merge=lfs -text
 *.joblib filter=lfs diff=lfs merge=lfs -text
 *.lfs.* filter=lfs diff=lfs merge=lfs -text
 *.mlmodel filter=lfs diff=lfs merge=lfs -text
 *.model filter=lfs diff=lfs merge=lfs -text
 *.msgpack filter=lfs diff=lfs merge=lfs -text
 *.npy filter=lfs diff=lfs merge=lfs -text
 *.npz filter=lfs diff=lfs merge=lfs -text
 *.onnx filter=lfs diff=lfs merge=lfs -text
 *.ot filter=lfs diff=lfs merge=lfs -text
 *.parquet filter=lfs diff=lfs merge=lfs -text
 *.pb filter=lfs diff=lfs merge=lfs -text
 *.pickle filter=lfs diff=lfs merge=lfs -text
 *.pkl filter=lfs diff=lfs merge=lfs -text
 *.pt filter=lfs diff=lfs merge=lfs -text
 *.pth filter=lfs diff=lfs merge=lfs -text
 *.rar filter=lfs diff=lfs merge=lfs -text
 *.safetensors filter=lfs diff=lfs merge=lfs -text
 saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.tar.* filter=lfs diff=lfs merge=lfs -text
 *.tar filter=lfs diff=lfs merge=lfs -text
 *.tflite filter=lfs diff=lfs merge=lfs -text
 *.tgz filter=lfs diff=lfs merge=lfs -text
 *.wasm filter=lfs diff=lfs merge=lfs -text
 *.xz filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
 AceInstruct-7B.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
--- a/AceInstruct-7B.i1-IQ1_M.gguf
+++ b/AceInstruct-7B.i1-IQ1_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:e7600d2b66a19a509c5c7777c84efbb90e18df6b1010412e5bd82475765b298e
 size 2042194464
--- a/AceInstruct-7B.i1-IQ1_S.gguf
+++ b/AceInstruct-7B.i1-IQ1_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:11a0c8e62c14ff9c1a4ee6c4528a2504bc02b0591a9c38167866057af702494e
 size 1903665696
--- a/AceInstruct-7B.i1-IQ2_M.gguf
+++ b/AceInstruct-7B.i1-IQ2_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:983d5eb973b4c6089e568b91170af08e48a8e753306ae4493deb7826ea400e4c
 size 2780340768
--- a/AceInstruct-7B.i1-IQ2_S.gguf
+++ b/AceInstruct-7B.i1-IQ2_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:617d50bdec9ba7ed11aff79291d2f9f1380a6284692452428c2d745798151087
 size 2595635744
--- a/AceInstruct-7B.i1-IQ2_XS.gguf
+++ b/AceInstruct-7B.i1-IQ2_XS.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:5402831062b80080c9a6d774039d6e8a1ff7ca443d2d7f83b819375af96f7aa8
 size 2469020192
--- a/AceInstruct-7B.i1-IQ2_XXS.gguf
+++ b/AceInstruct-7B.i1-IQ2_XXS.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:4eb139636e78dea1c10efde4c06e5cd92bc074c0d45bcfca0830d1786b52dc34
 size 2273075744
--- a/AceInstruct-7B.i1-IQ3_M.gguf
+++ b/AceInstruct-7B.i1-IQ3_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:65b668a7d456b33baab1923d5cc0807c3cdf9fba35c32dd49c4657755a288760
 size 3574010400
--- a/AceInstruct-7B.i1-IQ3_S.gguf
+++ b/AceInstruct-7B.i1-IQ3_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:0aaa5f1f0cdc4719bdb5c90f51217e1e5b24a2a73b42a790b9f927cd2b67e68e
 size 3499190816
--- a/AceInstruct-7B.i1-IQ3_XS.gguf
+++ b/AceInstruct-7B.i1-IQ3_XS.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:79d99bdb88e92aefc2cdce234882d7184df04bcd5f6a63ae00e9572ea5910f4c
 size 3346254368
--- a/AceInstruct-7B.i1-IQ3_XXS.gguf
+++ b/AceInstruct-7B.i1-IQ3_XXS.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:565c365df847b378ccee1553440fec06eeed79d256c440fd0772d5e88b936cb2
 size 3114512928
--- a/AceInstruct-7B.i1-IQ4_NL.gguf
+++ b/AceInstruct-7B.i1-IQ4_NL.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:f332cdd8887beae4e86fb052639797707774ecd947c791ee33d69c4f293ab69d
 size 4437811744
--- a/AceInstruct-7B.i1-IQ4_XS.gguf
+++ b/AceInstruct-7B.i1-IQ4_XS.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:a78615fc9b14f8025f17e27634e9f0471053da4c6ac4fe7b9b5936f77899f41a
 size 4218470944
--- a/AceInstruct-7B.i1-Q2_K.gguf
+++ b/AceInstruct-7B.i1-Q2_K.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:1dde90421493e078a31590f7334d527daf401ab59799b0ebd2332c393ea15b43
 size 3015938592
--- a/AceInstruct-7B.i1-Q2_K_S.gguf
+++ b/AceInstruct-7B.i1-Q2_K_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:b4da5958a2a5eeb41e28b9bbfa04b233c167885bb040cb8abe01b3449ec9df71
 size 2834072096
--- a/AceInstruct-7B.i1-Q3_K_L.gguf
+++ b/AceInstruct-7B.i1-Q3_K_L.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:51b4ff2cb51d100371e6e7cc226231d5e7bb2fac2c21c7f8e755000679653619
 size 4088457760
--- a/AceInstruct-7B.i1-Q3_K_M.gguf
+++ b/AceInstruct-7B.i1-Q3_K_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:3a1e144537e7ec8df25e6f86a816727dc308f939679dfa24c6bf85edaaf128db
 size 3808389664
--- a/AceInstruct-7B.i1-Q3_K_S.gguf
+++ b/AceInstruct-7B.i1-Q3_K_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:6563fb4d3a3810760506fe36a3aa781ae8adbba13a3f7de8b6be226edc95629a
 size 3492366880
--- a/AceInstruct-7B.i1-Q4_0.gguf
+++ b/AceInstruct-7B.i1-Q4_0.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:42cdfbd62831974f1541014cce6e2cabd63f33100a31deeafd0aa12e23724805
 size 4444119584
--- a/AceInstruct-7B.i1-Q4_1.gguf
+++ b/AceInstruct-7B.i1-Q4_1.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:1ea51bdf8981a7f413c916cc622adf123e11ef2b137cb7e4092f430d1b27ba1b
 size 4873282080
--- a/AceInstruct-7B.i1-Q4_K_M.gguf
+++ b/AceInstruct-7B.i1-Q4_K_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:dbc57cfa18e5722ccb29624ff7d02680aff570e160db8839facc5b37b0741895
 size 4683072032
--- a/AceInstruct-7B.i1-Q4_K_S.gguf
+++ b/AceInstruct-7B.i1-Q4_K_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:4a7fd501ba6ffd6a7cd359ef3c0bcff79bb0ee29a98a86fc25632a1b8b4a5238
 size 4457767456
--- a/AceInstruct-7B.i1-Q5_K_M.gguf
+++ b/AceInstruct-7B.i1-Q5_K_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:72a8997943d0ea0018e6b94e7c889f3dada9ad4bbc9d20d332ed1d9563f95665
 size 5444829728
--- a/AceInstruct-7B.i1-Q5_K_S.gguf
+++ b/AceInstruct-7B.i1-Q5_K_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:6b946f099d4db117b3d7b58552c828f38fcc3389b460deb47d5997c3da46f51a
 size 5315174944
--- a/AceInstruct-7B.i1-Q6_K.gguf
+++ b/AceInstruct-7B.i1-Q6_K.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:2fc9dc3dfe46a16a6f8544ea5527a949f1087ea617255adeb81c5ded60cc9eb1
 size 6254197280
--- a/README.md
+++ b/README.md
@@ -0,0 +1,84 @@
 ---
 base_model: nvidia/AceInstruct-7B
 language:
 - en
 library_name: transformers
 license: cc-by-nc-4.0
 quantized_by: mradermacher
 tags:
 - nvidia
 - AceInstruct
 - code
 - math
 - general_domain
 - instruct_model
 - pytorch
 ---
 ## About
 <!-- ### quantize_version: 2 -->
 <!-- ### output_tensor_quantised: 1 -->
 <!-- ### convert_type: hf -->
 <!-- ### vocab_type:  -->
 <!-- ### tags: nicoboss -->
 weighted/imatrix quants of https://huggingface.co/nvidia/AceInstruct-7B
 <!-- provided-files -->
 static quants are available at https://huggingface.co/mradermacher/AceInstruct-7B-GGUF
 ## Usage
 If you are unsure how to use GGUF files, refer to one of [TheBloke's
 READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
 more details, including on how to concatenate multi-part files.
 ## Provided Quants
 (sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
 | Link | Type | Size/GB | Notes |
 |:-----|:-----|--------:|:------|
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ1_S.gguf) | i1-IQ1_S | 2.0 | for the desperate |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ1_M.gguf) | i1-IQ1_M | 2.1 | mostly desperate |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.4 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.6 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ2_S.gguf) | i1-IQ2_S | 2.7 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ2_M.gguf) | i1-IQ2_M | 2.9 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q2_K_S.gguf) | i1-Q2_K_S | 2.9 | very low quality |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q2_K.gguf) | i1-Q2_K | 3.1 | IQ3_XXS probably better |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.2 | lower quality |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.4 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.6 | IQ3_XS probably better |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ3_S.gguf) | i1-IQ3_S | 3.6 | beats Q3_K* |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ3_M.gguf) | i1-IQ3_M | 3.7 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q3_K_M.gguf) | i1-Q3_K_M | 3.9 | IQ3_S probably better |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.2 | IQ3_M probably better |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.3 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.5 | prefer IQ4_XS |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q4_0.gguf) | i1-Q4_0 | 4.5 | fast, low quality |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.6 | optimal size/speed/quality |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q4_K_M.gguf) | i1-Q4_K_M | 4.8 | fast, recommended |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q4_1.gguf) | i1-Q4_1 | 5.0 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.4 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.5 |  |
 | [GGUF](https://huggingface.co/mradermacher/AceInstruct-7B-i1-GGUF/resolve/main/AceInstruct-7B.i1-Q6_K.gguf) | i1-Q6_K | 6.4 | practically like static Q6_K |
 Here is a handy graph by ikawrakow comparing some lower-quality quant
 types (lower is better):
 ![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
 And here are Artefact2's thoughts on the matter:
 https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
 ## FAQ / Model Request
 See https://huggingface.co/mradermacher/model_requests for some answers to
 questions you might have and/or if you want some other model quantized.
 ## Thanks
 I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
 me use its servers and providing upgrades to my workstation to enable
 this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
 <!-- end -->