commit 16d9e2f18d541ecce7983274ca6a35c7178f3bf0 Author: ModelHub XC Date: Sun May 10 12:40:04 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/FollowIR-7B-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..52f644f --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +FollowIR-7B.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/FollowIR-7B.i1-IQ1_M.gguf b/FollowIR-7B.i1-IQ1_M.gguf new file mode 100644 index 0000000..73fd010 --- /dev/null +++ b/FollowIR-7B.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a0064744ac5b3b452e6e44929df103231741a1e416f828b2f532acbe9785f05e +size 1754447424 diff --git a/FollowIR-7B.i1-IQ1_S.gguf b/FollowIR-7B.i1-IQ1_S.gguf new file mode 100644 index 0000000..51f7b03 --- /dev/null +++ b/FollowIR-7B.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:69ac8b56d52a5ce7511ad089216840a24c50dfd3e69c7c611f8e15d38b18c3d5 +size 1612103232 diff --git a/FollowIR-7B.i1-IQ2_M.gguf b/FollowIR-7B.i1-IQ2_M.gguf new file mode 100644 index 0000000..89d5010 --- /dev/null +++ b/FollowIR-7B.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3958228f965fd3353249e61829b85f5f25a68d9f9b269289b05cd2e7ddc0e32e +size 2500714048 diff --git a/FollowIR-7B.i1-IQ2_S.gguf b/FollowIR-7B.i1-IQ2_S.gguf new file mode 100644 index 0000000..b87466b --- /dev/null +++ b/FollowIR-7B.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b74a38128a72a0baa27b1f9b2655a159ce737dc50e264a8c13cafba293d15d00 +size 2310921792 diff --git a/FollowIR-7B.i1-IQ2_XS.gguf b/FollowIR-7B.i1-IQ2_XS.gguf new file mode 100644 index 0000000..7630ad2 --- /dev/null +++ b/FollowIR-7B.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4c603c278089fa775e8886854175c374a4f65f1e602a81dde1bb436892453478 +size 2198257216 diff --git a/FollowIR-7B.i1-IQ2_XXS.gguf b/FollowIR-7B.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..0e77b10 --- /dev/null +++ b/FollowIR-7B.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c250d6ca02dc05c922af75bda237c60c57794766d1164f170b6c0506defc7894 +size 1991687744 diff --git a/FollowIR-7B.i1-IQ3_M.gguf b/FollowIR-7B.i1-IQ3_M.gguf new file mode 100644 index 0000000..be34ed7 --- /dev/null +++ b/FollowIR-7B.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3c937587b5d5a51f33e56fa102b5657a16eacbcb731b5524ed0fe04ac7b1e5e3 +size 3284893248 diff --git a/FollowIR-7B.i1-IQ3_S.gguf b/FollowIR-7B.i1-IQ3_S.gguf new file mode 100644 index 0000000..934d1d8 --- /dev/null +++ b/FollowIR-7B.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3e26c997cc5b3e95b4787038f04acbf80f60c2de3a1b6604addb5a4882847058 +size 3182394944 diff --git a/FollowIR-7B.i1-IQ3_XS.gguf b/FollowIR-7B.i1-IQ3_XS.gguf new file mode 100644 index 0000000..5df4306 --- /dev/null +++ b/FollowIR-7B.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8b153d85e75de063ba5828a63fcffa232c94ed4a383a0daebd78036f710ff122 +size 3018817088 diff --git a/FollowIR-7B.i1-IQ3_XXS.gguf b/FollowIR-7B.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..f736ca8 --- /dev/null +++ b/FollowIR-7B.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4a24ca78a899650906d7ea89d9a473c215d3d44cbaa22c4ffe7155f325e28855 +size 2827345472 diff --git a/FollowIR-7B.i1-IQ4_NL.gguf b/FollowIR-7B.i1-IQ4_NL.gguf new file mode 100644 index 0000000..6d14ae9 --- /dev/null +++ b/FollowIR-7B.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:82db61abfcd5cfc78fe93b3b892fcf88cbd4e33058c3c574fb3420c18d9b7c53 +size 4125695552 diff --git a/FollowIR-7B.i1-IQ4_XS.gguf b/FollowIR-7B.i1-IQ4_XS.gguf new file mode 100644 index 0000000..5dfeeee --- /dev/null +++ b/FollowIR-7B.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:88660320e1cdc9b73bf1aaf79b538be4555d755df3f7f2dbdcc5b516e98e85b3 +size 3907690048 diff --git a/FollowIR-7B.i1-Q2_K.gguf b/FollowIR-7B.i1-Q2_K.gguf new file mode 100644 index 0000000..0a684a1 --- /dev/null +++ b/FollowIR-7B.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ccb2473fd7dcd419432d01a724af7d72b5a29a55a7d3b3440c5c6a85f523dc9a +size 2719243840 diff --git a/FollowIR-7B.i1-Q2_K_S.gguf b/FollowIR-7B.i1-Q2_K_S.gguf new file mode 100644 index 0000000..263300a --- /dev/null +++ b/FollowIR-7B.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:38913a350a71542ee89ef9bc7003d7590c2968095a284e3a777d9cd6f2ae56c6 +size 2528927296 diff --git a/FollowIR-7B.i1-Q3_K_L.gguf b/FollowIR-7B.i1-Q3_K_L.gguf new file mode 100644 index 0000000..3adfbde --- /dev/null +++ b/FollowIR-7B.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ca8dd2c49f2263186b6c22495bf4d91697ec76b0563d51dbd0bd074decadca9d +size 3822026304 diff --git a/FollowIR-7B.i1-Q3_K_M.gguf b/FollowIR-7B.i1-Q3_K_M.gguf new file mode 100644 index 0000000..05e33b6 --- /dev/null +++ b/FollowIR-7B.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:147d918c9bd74b265d6bdd7d1db65a7e4d7ae94ca94ef684ed0a97c67ffbe189 +size 3518987840 diff --git a/FollowIR-7B.i1-Q3_K_S.gguf b/FollowIR-7B.i1-Q3_K_S.gguf new file mode 100644 index 0000000..58a9f0b --- /dev/null +++ b/FollowIR-7B.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4fae1a0188e2f5fd5d6466542f9e54398e5e17e1e6ef64550701ae1728801d9a +size 3164569152 diff --git a/FollowIR-7B.i1-Q4_0.gguf b/FollowIR-7B.i1-Q4_0.gguf new file mode 100644 index 0000000..46ca51b --- /dev/null +++ b/FollowIR-7B.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:105310b31fa99519c9b143fb5ddcddbceba902419a253e01dd296443cd2b4a8c +size 4123598400 diff --git a/FollowIR-7B.i1-Q4_1.gguf b/FollowIR-7B.i1-Q4_1.gguf new file mode 100644 index 0000000..0a7d148 --- /dev/null +++ b/FollowIR-7B.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5dbb4b882bdf4bb5cae829e371d592964c053f445191b81e1fb97d062d7b915b +size 4553317952 diff --git a/FollowIR-7B.i1-Q4_K_M.gguf b/FollowIR-7B.i1-Q4_K_M.gguf new file mode 100644 index 0000000..b3b56c9 --- /dev/null +++ b/FollowIR-7B.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dcd25925e9be8bf6f25c761e49144b50e0bf7b7af760920e937329a7937b779e +size 4368440896 diff --git a/FollowIR-7B.i1-Q4_K_S.gguf b/FollowIR-7B.i1-Q4_K_S.gguf new file mode 100644 index 0000000..c447e0b --- /dev/null +++ b/FollowIR-7B.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5989c13d4f485394b24052cc991ab54316980e602146acdd9775673b23133c7d +size 4140375616 diff --git a/FollowIR-7B.i1-Q5_K_M.gguf b/FollowIR-7B.i1-Q5_K_M.gguf new file mode 100644 index 0000000..f56ec53 --- /dev/null +++ b/FollowIR-7B.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1d657e1320a136a5ba8fb1c743fd289b964a33ebbff4ba713ea3f97fcf205bb2 +size 5131411008 diff --git a/FollowIR-7B.i1-Q5_K_S.gguf b/FollowIR-7B.i1-Q5_K_S.gguf new file mode 100644 index 0000000..88c3997 --- /dev/null +++ b/FollowIR-7B.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5c88ee48a85dda9fde112452240634ffd5030925c78819e6e02059ebf444a4b1 +size 4997717568 diff --git a/FollowIR-7B.i1-Q6_K.gguf b/FollowIR-7B.i1-Q6_K.gguf new file mode 100644 index 0000000..1d6cd90 --- /dev/null +++ b/FollowIR-7B.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fba48d8620c084381d48288fd63b0c196f97b96b47e91d84637cd148ebec1296 +size 5942066752 diff --git a/README.md b/README.md new file mode 100644 index 0000000..3be5547 --- /dev/null +++ b/README.md @@ -0,0 +1,83 @@ +--- +base_model: jhu-clsp/FollowIR-7B +datasets: +- jhu-clsp/FollowIR-train +language: +- en +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +tags: +- retrieval +- instructions +- reranking +- mteb +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/jhu-clsp/FollowIR-7B + + +static quants are available at https://huggingface.co/mradermacher/FollowIR-7B-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ1_S.gguf) | i1-IQ1_S | 1.7 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ1_M.gguf) | i1-IQ1_M | 1.9 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.1 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.3 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ2_S.gguf) | i1-IQ2_S | 2.4 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ2_M.gguf) | i1-IQ2_M | 2.6 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q2_K_S.gguf) | i1-Q2_K_S | 2.6 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q2_K.gguf) | i1-Q2_K | 2.8 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 2.9 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.1 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.3 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ3_S.gguf) | i1-IQ3_S | 3.3 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ3_M.gguf) | i1-IQ3_M | 3.4 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q3_K_M.gguf) | i1-Q3_K_M | 3.6 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q3_K_L.gguf) | i1-Q3_K_L | 3.9 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.0 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q4_0.gguf) | i1-Q4_0 | 4.2 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.2 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.2 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q4_K_M.gguf) | i1-Q4_K_M | 4.5 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q4_1.gguf) | i1-Q4_1 | 4.7 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.1 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.2 | | +| [GGUF](https://huggingface.co/mradermacher/FollowIR-7B-i1-GGUF/resolve/main/FollowIR-7B.i1-Q6_K.gguf) | i1-Q6_K | 6.0 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..414e2bf --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9d727e7aa6ceeab8939cbac50f965ab33c325182f97d151d5050002db5d26311 +size 4988157