初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/Apollo2-1.5B-i1-GGUF Source: Original Platform
This commit is contained in:
60
.gitattributes
vendored
Normal file
60
.gitattributes
vendored
Normal file
@@ -0,0 +1,60 @@
|
|||||||
|
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.model filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||||
|
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
Apollo2-1.5B.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
3
Apollo2-1.5B.i1-IQ1_M.gguf
Normal file
3
Apollo2-1.5B.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:ab2ad9b7049eb77031e0079b31d0f884d27ca31905bd4a57971376ff11e95ba1
|
||||||
|
size 541036064
|
||||||
3
Apollo2-1.5B.i1-IQ1_S.gguf
Normal file
3
Apollo2-1.5B.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:c3e6132b9f929591e0641106be374802fa04ca4f8e99fbeb0cf88027367737c1
|
||||||
|
size 513102368
|
||||||
3
Apollo2-1.5B.i1-IQ2_M.gguf
Normal file
3
Apollo2-1.5B.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:175cf917937859352a7db6ea4a1b47588cd06552173799b6da972f70aba8c66b
|
||||||
|
size 701331488
|
||||||
3
Apollo2-1.5B.i1-IQ2_S.gguf
Normal file
3
Apollo2-1.5B.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:72598e246c7bd43c7410e4babc9ab65eb082e30be9fd0aec19e5ff51e4922c85
|
||||||
|
size 664086560
|
||||||
3
Apollo2-1.5B.i1-IQ2_XS.gguf
Normal file
3
Apollo2-1.5B.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:38f7992f8e2046d0750f129a5e0f671a729436fb3c93fd099adb4d5e871a8f34
|
||||||
|
size 626901536
|
||||||
3
Apollo2-1.5B.i1-IQ2_XXS.gguf
Normal file
3
Apollo2-1.5B.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:fe5194eaed7161e0b3fb6a229d561dec908f66e99daafec38eadbb0cb078c03a
|
||||||
|
size 587592224
|
||||||
3
Apollo2-1.5B.i1-IQ3_M.gguf
Normal file
3
Apollo2-1.5B.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:7d9dda38d1b3e31baeb886d9152bf00a09a5055140e8e6fb75a88bd19d5bb05f
|
||||||
|
size 876940832
|
||||||
3
Apollo2-1.5B.i1-IQ3_S.gguf
Normal file
3
Apollo2-1.5B.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:8b17f2e9b9bda77dd2602f5ecc444bd3c3266b9519a3d64a208711fa8730c314
|
||||||
|
size 862683680
|
||||||
3
Apollo2-1.5B.i1-IQ3_XS.gguf
Normal file
3
Apollo2-1.5B.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:ebd7fba63a58bd0d6bcc1d9a8b488f251907277ff1383849d48e6016d9b292e0
|
||||||
|
size 831975968
|
||||||
3
Apollo2-1.5B.i1-IQ3_XXS.gguf
Normal file
3
Apollo2-1.5B.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:d9aa364dcb94ddfc4233319612839637ce94d8915aece7fc77b79711451becef
|
||||||
|
size 769069088
|
||||||
3
Apollo2-1.5B.i1-IQ4_NL.gguf
Normal file
3
Apollo2-1.5B.i1-IQ4_NL.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:dee84b6440c3e47f250f1c7217a5cb3d183fbebd1dc47103f2311776c5ba20e8
|
||||||
|
size 1067602976
|
||||||
3
Apollo2-1.5B.i1-IQ4_XS.gguf
Normal file
3
Apollo2-1.5B.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:4ca2265f56cb4bc6f8d6900c6a8ece74c86f4eeed73254f6ce9e498395222a3d
|
||||||
|
size 1019710496
|
||||||
3
Apollo2-1.5B.i1-Q2_K.gguf
Normal file
3
Apollo2-1.5B.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:62ef9cf05add0ec1c52ade4dee7ccd5f3e1f50542ec193364f5c66bc2a4dd996
|
||||||
|
size 752879648
|
||||||
3
Apollo2-1.5B.i1-Q2_K_S.gguf
Normal file
3
Apollo2-1.5B.i1-Q2_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:dee4916b0821e141f0b90e52fc9cf2070f4a4dba8c5c3262a73d33aae0757745
|
||||||
|
size 716709920
|
||||||
3
Apollo2-1.5B.i1-Q3_K_L.gguf
Normal file
3
Apollo2-1.5B.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:14fe66a19e740a92e189c1707ad9eb385128e7d190edb6fdc4dd15fa7a7a8819
|
||||||
|
size 980439584
|
||||||
3
Apollo2-1.5B.i1-Q3_K_M.gguf
Normal file
3
Apollo2-1.5B.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:de8864d8b16a2512d050df80288f36feb2166f4a46d33a80b18973fe7256e618
|
||||||
|
size 924455456
|
||||||
3
Apollo2-1.5B.i1-Q3_K_S.gguf
Normal file
3
Apollo2-1.5B.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:c3958d6bdd5a01b57115176cad0ba419201fd95028cd70b8c6c9e46bfb3e4442
|
||||||
|
size 861221408
|
||||||
3
Apollo2-1.5B.i1-Q4_0.gguf
Normal file
3
Apollo2-1.5B.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:87416bb07125044ef2c4c06f64e20ca5ad282c69a49a96bb8409aa0fcadd7050
|
||||||
|
size 1068807200
|
||||||
3
Apollo2-1.5B.i1-Q4_1.gguf
Normal file
3
Apollo2-1.5B.i1-Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:56d4ca1cf4329089d55f6c5d7a837efeefd38d4a327320ab0f4454e30dbd3aa1
|
||||||
|
size 1162699808
|
||||||
3
Apollo2-1.5B.i1-Q4_K_M.gguf
Normal file
3
Apollo2-1.5B.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:81ee78f51b5f462833209b8000a9a0b20d0c0dbaf1b17cbd8a07232f1c57f8fe
|
||||||
|
size 1117320224
|
||||||
3
Apollo2-1.5B.i1-Q4_K_S.gguf
Normal file
3
Apollo2-1.5B.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:7d351b13dbcc4f667b697c3dae59f4908260a24660c007058d79d0d43b5d9fa5
|
||||||
|
size 1071584288
|
||||||
3
Apollo2-1.5B.i1-Q5_K_M.gguf
Normal file
3
Apollo2-1.5B.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:0db1e1fe80ffc17e3a6040d3bcad3f3794a1ec17bc087667d72708e760e6fe9b
|
||||||
|
size 1285493792
|
||||||
3
Apollo2-1.5B.i1-Q5_K_S.gguf
Normal file
3
Apollo2-1.5B.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:288a74fcb5877c4973b4bccce5fc47210b04890d9414895e1951aa902d3950fe
|
||||||
|
size 1259172896
|
||||||
3
Apollo2-1.5B.i1-Q6_K.gguf
Normal file
3
Apollo2-1.5B.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:bf2acf369f06c25d2864fd9c7980068f0cb85f3e4bfead8e482dddc8df7c8a59
|
||||||
|
size 1464178208
|
||||||
123
README.md
Normal file
123
README.md
Normal file
@@ -0,0 +1,123 @@
|
|||||||
|
---
|
||||||
|
base_model: FreedomIntelligence/Apollo2-1.5B
|
||||||
|
datasets:
|
||||||
|
- FreedomIntelligence/ApolloMoEDataset
|
||||||
|
language:
|
||||||
|
- ar
|
||||||
|
- en
|
||||||
|
- zh
|
||||||
|
- ko
|
||||||
|
- ja
|
||||||
|
- mn
|
||||||
|
- th
|
||||||
|
- vi
|
||||||
|
- lo
|
||||||
|
- mg
|
||||||
|
- de
|
||||||
|
- pt
|
||||||
|
- es
|
||||||
|
- fr
|
||||||
|
- ru
|
||||||
|
- it
|
||||||
|
- hr
|
||||||
|
- gl
|
||||||
|
- cs
|
||||||
|
- co
|
||||||
|
- la
|
||||||
|
- uk
|
||||||
|
- bs
|
||||||
|
- bg
|
||||||
|
- eo
|
||||||
|
- sq
|
||||||
|
- da
|
||||||
|
- sa
|
||||||
|
- gn
|
||||||
|
- sr
|
||||||
|
- sk
|
||||||
|
- gd
|
||||||
|
- lb
|
||||||
|
- hi
|
||||||
|
- ku
|
||||||
|
- mt
|
||||||
|
- he
|
||||||
|
- ln
|
||||||
|
- bm
|
||||||
|
- sw
|
||||||
|
- ig
|
||||||
|
- rw
|
||||||
|
- ha
|
||||||
|
library_name: transformers
|
||||||
|
license: apache-2.0
|
||||||
|
quantized_by: mradermacher
|
||||||
|
tags:
|
||||||
|
- biology
|
||||||
|
- medical
|
||||||
|
---
|
||||||
|
## About
|
||||||
|
|
||||||
|
<!-- ### quantize_version: 2 -->
|
||||||
|
<!-- ### output_tensor_quantised: 1 -->
|
||||||
|
<!-- ### convert_type: hf -->
|
||||||
|
<!-- ### vocab_type: -->
|
||||||
|
<!-- ### tags: nicoboss -->
|
||||||
|
weighted/imatrix quants of https://huggingface.co/FreedomIntelligence/Apollo2-1.5B
|
||||||
|
|
||||||
|
<!-- provided-files -->
|
||||||
|
static quants are available at https://huggingface.co/mradermacher/Apollo2-1.5B-GGUF
|
||||||
|
## Usage
|
||||||
|
|
||||||
|
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||||
|
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||||
|
more details, including on how to concatenate multi-part files.
|
||||||
|
|
||||||
|
## Provided Quants
|
||||||
|
|
||||||
|
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||||
|
|
||||||
|
| Link | Type | Size/GB | Notes |
|
||||||
|
|:-----|:-----|--------:|:------|
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ1_S.gguf) | i1-IQ1_S | 0.6 | for the desperate |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ1_M.gguf) | i1-IQ1_M | 0.6 | mostly desperate |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.7 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.7 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ2_S.gguf) | i1-IQ2_S | 0.8 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ2_M.gguf) | i1-IQ2_M | 0.8 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.8 | very low quality |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q2_K.gguf) | i1-Q2_K | 0.9 | IQ3_XXS probably better |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.9 | lower quality |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.9 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q3_K_S.gguf) | i1-Q3_K_S | 1.0 | IQ3_XS probably better |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ3_S.gguf) | i1-IQ3_S | 1.0 | beats Q3_K* |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ3_M.gguf) | i1-IQ3_M | 1.0 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q3_K_M.gguf) | i1-Q3_K_M | 1.0 | IQ3_S probably better |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q3_K_L.gguf) | i1-Q3_K_L | 1.1 | IQ3_M probably better |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ4_XS.gguf) | i1-IQ4_XS | 1.1 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-IQ4_NL.gguf) | i1-IQ4_NL | 1.2 | prefer IQ4_XS |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q4_0.gguf) | i1-Q4_0 | 1.2 | fast, low quality |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q4_K_S.gguf) | i1-Q4_K_S | 1.2 | optimal size/speed/quality |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q4_K_M.gguf) | i1-Q4_K_M | 1.2 | fast, recommended |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q4_1.gguf) | i1-Q4_1 | 1.3 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q5_K_S.gguf) | i1-Q5_K_S | 1.4 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q5_K_M.gguf) | i1-Q5_K_M | 1.4 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/Apollo2-1.5B-i1-GGUF/resolve/main/Apollo2-1.5B.i1-Q6_K.gguf) | i1-Q6_K | 1.6 | practically like static Q6_K |
|
||||||
|
|
||||||
|
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||||
|
types (lower is better):
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
And here are Artefact2's thoughts on the matter:
|
||||||
|
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||||
|
|
||||||
|
## FAQ / Model Request
|
||||||
|
|
||||||
|
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||||
|
questions you might have and/or if you want some other model quantized.
|
||||||
|
|
||||||
|
## Thanks
|
||||||
|
|
||||||
|
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||||
|
me use its servers and providing upgrades to my workstation to enable
|
||||||
|
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||||
|
|
||||||
|
<!-- end -->
|
||||||
3
imatrix.dat
Normal file
3
imatrix.dat
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:7c0033d9ab60cf88d08bbd88149644225d9766f449ead9c2c4695f0c08768202
|
||||||
|
size 2042201
|
||||||
Reference in New Issue
Block a user