初始化项目,由ModelHub XC社区提供模型

Model: mradermacher/GRaPE-Flash-i1-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-09 20:15:34 +08:00
commit 870d23c1b3
27 changed files with 220 additions and 0 deletions

60
.gitattributes vendored Normal file
View File

@@ -0,0 +1,60 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
GRaPE-Flash.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d05692d9a8cb19f0096fc7b9745535d53eb21f2723d2a756bef4abd8ccbcbf54
size 1638392896

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a74e9591433d327f727d2419376450556d06c03748a77748e1933dcd2f7b05e8
size 1490543680

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:23e9821af8c21aa416d253e83af03c7c195d1971ada14189c2ad8c8db47329f5
size 2328333376

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:bf00b1a41b711d1fe521719c9811f14dd5394b9b20356c6c451308aef2157e37
size 2131201088

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1c367aef5a1715cf731b813e05c67863f7440e3a4658fec702f62eff280d578a
size 2084037696

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a97c6f86781c594153c9fda2e1e78af54c91b752c9cfde1f3ecdde0025ccac73
size 1884808256

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:665073f55325dd09719ed6a710e559c6409a4de5b3688bd09bbee5c56439bd93
size 3076543552

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6e405841059711e45af2b0c1cf127c4290f365513b08fc581454a2e44bc33901
size 3023066176

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ec2e80cd5b60db329c2ae290be2e84b6f5dd5ee8c1c73ae6e394a485587f3ebf
size 2865779776

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:25a75d586129f7ccb83220d5a440a59b5404ca827a21b4164b16df730e6c27c3
size 2689567808

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f80af4643baff1fb5f31f27cad622d84e33c174d8a0132893ee95ef987c1cb65
size 3928038464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6ffb5ea6be407e6f1c373aa22e26d32d56252f428b4d31bcd35a3f66795ba4fc
size 3715103808

3
GRaPE-Flash.i1-Q2_K.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:378456c288854194ea5acf0a5c2fac7635a339ea31aa9845f8fbae21d0529292
size 2562763840

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:628b0ec455e9f99ff59ea7bc912612f7ca6d997ff5a97b9133067d653314142e
size 2393943104

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:39dc36172d04abfaf5a58870568f80d0009c4db8cb5e8c05441a16552c1afcaf
size 3611317312

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e5481f5ccb74213d93a7df06a7d28cda66987cea7901b9aec63be25324c7736a
size 3343930432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d9fa99cb7a7c87bd10e1fd23cf20366257854ca6643e3670be8334833887a6d3
size 3023066176

3
GRaPE-Flash.i1-Q4_0.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:bd2e502f513ab74083e4c18dee062faa363f319b0d7b66e7a21d65bf46dc5970
size 3944815680

3
GRaPE-Flash.i1-Q4_1.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:573a5ae328de4074eb71fceba2c53a26c81c558c76d16bf59ab8bc923f65485d
size 4353907776

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:bf24fef96fb773f6d8875aff639b4a07098b70357e70a92de7fb840708962d89
size 4213513280

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2f46a126f973bb9b43d3b55994f25b240bacae57b5b2b761844edd9bd95be3d3
size 3963690048

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1e71f3fe58f76ae568fbfe83eccfbbef99feef212938e4d2542b5605e6b7fa62
size 4926839872

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ba9fd05c9ca6170909cb3708768cf342072a7149bc013d638cfacc77e55a8e16
size 4779777088

3
GRaPE-Flash.i1-Q6_K.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a07a9f1c3f320b988bcaa5e770086be452d9b4a2a7ff469c784c2804cbe94624
size 5684749376

3
GRaPE-Flash.imatrix.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:04c12ab375c742197998508e79dc4ee5a72a30ce88949b4824d2824ee3fe9390
size 21658752

85
README.md Normal file
View File

@@ -0,0 +1,85 @@
---
base_model: SL-AI/GRaPE-Flash
language:
- en
library_name: transformers
license: apache-2.0
mradermacher:
readme_rev: 1
quantized_by: mradermacher
---
## About
<!-- ### quantize_version: 2 -->
<!-- ### output_tensor_quantised: 1 -->
<!-- ### convert_type: hf -->
<!-- ### vocab_type: -->
<!-- ### tags: nicoboss -->
<!-- ### quants: Q2_K IQ3_M Q4_K_S IQ3_XXS Q3_K_M small-IQ4_NL Q4_K_M IQ2_M Q6_K IQ4_XS Q2_K_S IQ1_M Q3_K_S IQ2_XXS Q3_K_L IQ2_XS Q5_K_S IQ2_S IQ1_S Q5_K_M Q4_0 IQ3_XS Q4_1 IQ3_S -->
<!-- ### quants_skip: -->
<!-- ### skip_mmproj: -->
weighted/imatrix quants of https://huggingface.co/SL-AI/GRaPE-Flash
<!-- provided-files -->
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#GRaPE-Flash-i1-GGUF).***
static quants are available at https://huggingface.co/mradermacher/GRaPE-Flash-GGUF
## Usage
If you are unsure how to use GGUF files, refer to one of [TheBloke's
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
more details, including on how to concatenate multi-part files.
## Provided Quants
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
| Link | Type | Size/GB | Notes |
|:-----|:-----|--------:|:------|
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ1_S.gguf) | i1-IQ1_S | 1.6 | for the desperate |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ1_M.gguf) | i1-IQ1_M | 1.7 | mostly desperate |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.0 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.2 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ2_S.gguf) | i1-IQ2_S | 2.2 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ2_M.gguf) | i1-IQ2_M | 2.4 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q2_K_S.gguf) | i1-Q2_K_S | 2.5 | very low quality |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q2_K.gguf) | i1-Q2_K | 2.7 | IQ3_XXS probably better |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 2.8 | lower quality |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.0 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ3_S.gguf) | i1-IQ3_S | 3.1 | beats Q3_K* |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.1 | IQ3_XS probably better |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ3_M.gguf) | i1-IQ3_M | 3.2 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q3_K_M.gguf) | i1-Q3_K_M | 3.4 | IQ3_S probably better |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q3_K_L.gguf) | i1-Q3_K_L | 3.7 | IQ3_M probably better |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ4_XS.gguf) | i1-IQ4_XS | 3.8 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.0 | prefer IQ4_XS |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q4_0.gguf) | i1-Q4_0 | 4.0 | fast, low quality |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.1 | optimal size/speed/quality |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q4_K_M.gguf) | i1-Q4_K_M | 4.3 | fast, recommended |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q4_1.gguf) | i1-Q4_1 | 4.5 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q5_K_S.gguf) | i1-Q5_K_S | 4.9 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.0 | |
| [GGUF](https://huggingface.co/mradermacher/GRaPE-Flash-i1-GGUF/resolve/main/GRaPE-Flash.i1-Q6_K.gguf) | i1-Q6_K | 5.8 | practically like static Q6_K |
Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):
![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
## FAQ / Model Request
See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.
## Thanks
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
<!-- end -->