commit 46e9981da68f9a6d8cce8c560c77aa210fa77697 Author: ModelHub XC Date: Wed Apr 22 04:42:41 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/DiamondForce-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..0a963ed --- /dev/null +++ b/.gitattributes @@ -0,0 +1,57 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +DiamondForce.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/DiamondForce.i1-IQ1_M.gguf b/DiamondForce.i1-IQ1_M.gguf new file mode 100644 index 0000000..ff26650 --- /dev/null +++ b/DiamondForce.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:15315544e51fbeea5014d37a7a744868eeddfd2ee310751b9d992d11ac521a4a +size 3138609824 diff --git a/DiamondForce.i1-IQ1_S.gguf b/DiamondForce.i1-IQ1_S.gguf new file mode 100644 index 0000000..3975016 --- /dev/null +++ b/DiamondForce.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8192659bb7dbb017d5013d371f7677d45ebe7565435076c2426369b956c54614 +size 2898686624 diff --git a/DiamondForce.i1-IQ2_M.gguf b/DiamondForce.i1-IQ2_M.gguf new file mode 100644 index 0000000..f20a4b8 --- /dev/null +++ b/DiamondForce.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ae76d934a4198aa197648fb3d55dcbdfc9a7c2d9f00104af6e1869de8d6fac46 +size 4517579424 diff --git a/DiamondForce.i1-IQ2_S.gguf b/DiamondForce.i1-IQ2_S.gguf new file mode 100644 index 0000000..abd6c46 --- /dev/null +++ b/DiamondForce.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:87d31459b057538d835a855dfd467b5742a470a28703571826227c90076ac1d1 +size 4197681824 diff --git a/DiamondForce.i1-IQ2_XS.gguf b/DiamondForce.i1-IQ2_XS.gguf new file mode 100644 index 0000000..9604d25 --- /dev/null +++ b/DiamondForce.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2c0716ea77c8eb877d4220a3f5309dc0c56d8b738a3170d0aa49c0a522ec8349 +size 3891147424 diff --git a/DiamondForce.i1-IQ2_XXS.gguf b/DiamondForce.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..479bb25 --- /dev/null +++ b/DiamondForce.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b88fd3b00d780376859b37060302288881b8547eeee69b1d58c4dbb4addb539e +size 3538481824 diff --git a/DiamondForce.i1-IQ3_M.gguf b/DiamondForce.i1-IQ3_M.gguf new file mode 100644 index 0000000..d367b8b --- /dev/null +++ b/DiamondForce.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fb971f27c2c784d93cda45be5c77987b1e9c23d8283bee8f4f3751cba011e8b2 +size 5984510624 diff --git a/DiamondForce.i1-IQ3_S.gguf b/DiamondForce.i1-IQ3_S.gguf new file mode 100644 index 0000000..114459a --- /dev/null +++ b/DiamondForce.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:94435223d5e92bb88b973803ccf7842bd392026a5bc075485681b8b264fc189c +size 5658981024 diff --git a/DiamondForce.i1-IQ3_XS.gguf b/DiamondForce.i1-IQ3_XS.gguf new file mode 100644 index 0000000..81a4877 --- /dev/null +++ b/DiamondForce.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:71edc3f137d4ff07ff7690795825f9fe709fc7596a37e1af933ee8e3cc4ddfa6 +size 5361611424 diff --git a/DiamondForce.i1-IQ3_XXS.gguf b/DiamondForce.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..155ee37 --- /dev/null +++ b/DiamondForce.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:259f227687a58267aabfb206486fb5760ef45541fe796346ce0e7da8389d4257 +size 4960561824 diff --git a/DiamondForce.i1-IQ4_XS.gguf b/DiamondForce.i1-IQ4_XS.gguf new file mode 100644 index 0000000..33cbe54 --- /dev/null +++ b/DiamondForce.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2f42afdd1efb5ba0809ceebcfa4f98333252ebe6f6b3b572db67df314e48d7d0 +size 6964222624 diff --git a/DiamondForce.i1-Q2_K.gguf b/DiamondForce.i1-Q2_K.gguf new file mode 100644 index 0000000..45a8262 --- /dev/null +++ b/DiamondForce.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c90b9f1fc33e6fb908de26b8a9a4ae5fedf2f75845b495573bed3f80b53b86dc +size 4854270624 diff --git a/DiamondForce.i1-Q3_K_L.gguf b/DiamondForce.i1-Q3_K_L.gguf new file mode 100644 index 0000000..f6010b9 --- /dev/null +++ b/DiamondForce.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a7c7697ed95508f624968d7ddb64cbf647d620dafd0e0f84a799153d393f763f +size 6929560224 diff --git a/DiamondForce.i1-Q3_K_M.gguf b/DiamondForce.i1-Q3_K_M.gguf new file mode 100644 index 0000000..e9752e8 --- /dev/null +++ b/DiamondForce.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:de5db15e169524b50bb6af5597a019e5d55c9f47cf02c8e796f0134f1b3994fc +size 6337770144 diff --git a/DiamondForce.i1-Q3_K_S.gguf b/DiamondForce.i1-Q3_K_S.gguf new file mode 100644 index 0000000..22ffd57 --- /dev/null +++ b/DiamondForce.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:81f0111b2d048fc709b3c15ef1d694809b0e882a417a29902ed438f3f65033da +size 5658981024 diff --git a/DiamondForce.i1-Q4_0.gguf b/DiamondForce.i1-Q4_0.gguf new file mode 100644 index 0000000..d7d3843 --- /dev/null +++ b/DiamondForce.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:20560af48dcd4edf620c42173bc2a9fe531a908270987316fcd472d273498d83 +size 7387953824 diff --git a/DiamondForce.i1-Q4_K_M.gguf b/DiamondForce.i1-Q4_K_M.gguf new file mode 100644 index 0000000..5a1e8bc --- /dev/null +++ b/DiamondForce.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dd179b93f4f34f12b839ba50146942f33a6d8cba3a18c8b9650bce41b6bb2703 +size 7865957024 diff --git a/DiamondForce.i1-Q4_K_S.gguf b/DiamondForce.i1-Q4_K_S.gguf new file mode 100644 index 0000000..26b6da9 --- /dev/null +++ b/DiamondForce.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:96e406089fc20b266f68d9758119cc33cb09ae77338ada3e3171e2b9b92ce263 +size 7423179424 diff --git a/DiamondForce.i1-Q5_K_M.gguf b/DiamondForce.i1-Q5_K_M.gguf new file mode 100644 index 0000000..8c0be13 --- /dev/null +++ b/DiamondForce.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:acdf88815c8de46f9b07fc789c981c08075b7db6b1d2e035dba2baa54b381001 +size 9229925024 diff --git a/DiamondForce.i1-Q5_K_S.gguf b/DiamondForce.i1-Q5_K_S.gguf new file mode 100644 index 0000000..065fd10 --- /dev/null +++ b/DiamondForce.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9e2194bb7543b5a5784583848d1815426fd179dd4cc12f731f4f13b96e7bff19 +size 8972286624 diff --git a/DiamondForce.i1-Q6_K.gguf b/DiamondForce.i1-Q6_K.gguf new file mode 100644 index 0000000..988c805 --- /dev/null +++ b/DiamondForce.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7b95b6828fcc350f64f36cbb612252b25bf1bfc37b8d58123f37f5ec88e357a1 +size 10679141024 diff --git a/README.md b/README.md new file mode 100644 index 0000000..9085ab6 --- /dev/null +++ b/README.md @@ -0,0 +1,72 @@ +--- +base_model: sequelbox/Llama2-13B-DiamondForce +language: +- en +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +--- +## About + + + + + +weighted/imatrix quants of https://huggingface.co/sequelbox/Llama2-13B-DiamondForce + + +static quants are available at https://huggingface.co/mradermacher/DiamondForce-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ1_S.gguf) | i1-IQ1_S | 3.0 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ1_M.gguf) | i1-IQ1_M | 3.2 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 3.6 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ2_XS.gguf) | i1-IQ2_XS | 4.0 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ2_S.gguf) | i1-IQ2_S | 4.3 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ2_M.gguf) | i1-IQ2_M | 4.6 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q2_K.gguf) | i1-Q2_K | 5.0 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 5.1 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ3_XS.gguf) | i1-IQ3_XS | 5.5 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ3_S.gguf) | i1-IQ3_S | 5.8 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q3_K_S.gguf) | i1-Q3_K_S | 5.8 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ3_M.gguf) | i1-IQ3_M | 6.1 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q3_K_M.gguf) | i1-Q3_K_M | 6.4 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q3_K_L.gguf) | i1-Q3_K_L | 7.0 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-IQ4_XS.gguf) | i1-IQ4_XS | 7.1 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q4_0.gguf) | i1-Q4_0 | 7.5 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q4_K_S.gguf) | i1-Q4_K_S | 7.5 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q4_K_M.gguf) | i1-Q4_K_M | 8.0 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q5_K_S.gguf) | i1-Q5_K_S | 9.1 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q5_K_M.gguf) | i1-Q5_K_M | 9.3 | | +| [GGUF](https://huggingface.co/mradermacher/DiamondForce-i1-GGUF/resolve/main/DiamondForce.i1-Q6_K.gguf) | i1-Q6_K | 10.8 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..71226b6 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c08a547e2117a2b1a7b804b75a391a798fa7ac4aeca86eab2cf9d14312234b68 +size 7136327