commit 3b532efc74b60290a5204b5551f5a05bd95c494f Author: ModelHub XC Date: Mon Apr 20 09:19:21 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Murasaki-4B-v0.3-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..fa92045 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,47 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text +Murasaki-4B-v0.3.f16.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Murasaki-4B-v0.3.IQ4_XS.gguf b/Murasaki-4B-v0.3.IQ4_XS.gguf new file mode 100644 index 0000000..8451cbb --- /dev/null +++ b/Murasaki-4B-v0.3.IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1663e14af7d3167c2e62e460986bf5219b6051e48df1ebd98b96ea5ffb7baa87 +size 2492949504 diff --git a/Murasaki-4B-v0.3.Q2_K.gguf b/Murasaki-4B-v0.3.Q2_K.gguf new file mode 100644 index 0000000..83b13c6 --- /dev/null +++ b/Murasaki-4B-v0.3.Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2fd554486f72ed12e018418d5c2174a8f6d6c35d892f66402586fa6ce776fb38 +size 1797126144 diff --git a/Murasaki-4B-v0.3.Q3_K_L.gguf b/Murasaki-4B-v0.3.Q3_K_L.gguf new file mode 100644 index 0000000..9ad970b --- /dev/null +++ b/Murasaki-4B-v0.3.Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7eceb6268bf0914dbe22837d6d0ef2518f004bc5f635efd0a1a93046300bcd4c +size 2406915584 diff --git a/Murasaki-4B-v0.3.Q3_K_M.gguf b/Murasaki-4B-v0.3.Q3_K_M.gguf new file mode 100644 index 0000000..c7dfbd6 --- /dev/null +++ b/Murasaki-4B-v0.3.Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:68b16094917b824d7d41e973ffeac02049df2627232ba149bd4f4484438c4cd8 +size 2242747904 diff --git a/Murasaki-4B-v0.3.Q3_K_S.gguf b/Murasaki-4B-v0.3.Q3_K_S.gguf new file mode 100644 index 0000000..b3d75b2 --- /dev/null +++ b/Murasaki-4B-v0.3.Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7665a7ba6fddedb37468d34e69a354bafc4108f8b0e37d576f29d92ad8b7fdb8 +size 2054127104 diff --git a/Murasaki-4B-v0.3.Q4_K_M.gguf b/Murasaki-4B-v0.3.Q4_K_M.gguf new file mode 100644 index 0000000..1a0966b --- /dev/null +++ b/Murasaki-4B-v0.3.Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:32ba85b69da7141a37888d74bc41d1d34b769bccf24c0b96873c734e9027c49b +size 2716068864 diff --git a/Murasaki-4B-v0.3.Q4_K_S.gguf b/Murasaki-4B-v0.3.Q4_K_S.gguf new file mode 100644 index 0000000..74ba06f --- /dev/null +++ b/Murasaki-4B-v0.3.Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:619e53c8f70c21abc524c8313e6394fc25f94191eeeb02caf5b18c151ea24045 +size 2602097664 diff --git a/Murasaki-4B-v0.3.Q5_K_M.gguf b/Murasaki-4B-v0.3.Q5_K_M.gguf new file mode 100644 index 0000000..4051d93 --- /dev/null +++ b/Murasaki-4B-v0.3.Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9e6ab7965b214bef37ca5e26b1b7b5db7ba516fef1a4c4ba2fff950691de2a3f +size 3156921344 diff --git a/Murasaki-4B-v0.3.Q5_K_S.gguf b/Murasaki-4B-v0.3.Q5_K_S.gguf new file mode 100644 index 0000000..38e905e --- /dev/null +++ b/Murasaki-4B-v0.3.Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bf33acd5c3283c1d95fc173ba1da97d655f66cc961285ff7be7fe9919cb1fb31 +size 3091119104 diff --git a/Murasaki-4B-v0.3.Q6_K.gguf b/Murasaki-4B-v0.3.Q6_K.gguf new file mode 100644 index 0000000..8294821 --- /dev/null +++ b/Murasaki-4B-v0.3.Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0ff6edf9649aefb077d33f4aae95a2ce603ac2b61e0a7c06400dd6c9baccfad5 +size 3625327104 diff --git a/Murasaki-4B-v0.3.Q8_0.gguf b/Murasaki-4B-v0.3.Q8_0.gguf new file mode 100644 index 0000000..3c0a8b1 --- /dev/null +++ b/Murasaki-4B-v0.3.Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bba8016655f3629c600359320768abb1f52ebfda3f45dbddb2dd3d604b4b02d2 +size 4693671424 diff --git a/Murasaki-4B-v0.3.f16.gguf b/Murasaki-4B-v0.3.f16.gguf new file mode 100644 index 0000000..1183274 --- /dev/null +++ b/Murasaki-4B-v0.3.f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9c9cae9994002d6fbdd1e652938868060811d8925ea4864c7a5096e34a44e1f7 +size 8829197824 diff --git a/README.md b/README.md new file mode 100644 index 0000000..ce2e1d3 --- /dev/null +++ b/README.md @@ -0,0 +1,83 @@ +--- +base_model: Murasaki-Project/Murasaki-4B-v0.3 +language: +- ja +- zh +library_name: transformers +license: cc-by-nc-sa-4.0 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- translation +- light-novel +- galgame +- anime-subtitles +- cot +- system-2 +- text-generation-inference +- transformers +- pytorch +--- +## About + + + + + + + + + +static quants of https://huggingface.co/Murasaki-Project/Murasaki-4B-v0.3 + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Murasaki-4B-v0.3-GGUF).*** + +weighted/imatrix quants are available at https://huggingface.co/mradermacher/Murasaki-4B-v0.3-i1-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q2_K.gguf) | Q2_K | 1.9 | | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q3_K_S.gguf) | Q3_K_S | 2.2 | | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q3_K_M.gguf) | Q3_K_M | 2.3 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q3_K_L.gguf) | Q3_K_L | 2.5 | | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.IQ4_XS.gguf) | IQ4_XS | 2.6 | | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q4_K_S.gguf) | Q4_K_S | 2.7 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q4_K_M.gguf) | Q4_K_M | 2.8 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q5_K_S.gguf) | Q5_K_S | 3.2 | | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q5_K_M.gguf) | Q5_K_M | 3.3 | | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q6_K.gguf) | Q6_K | 3.7 | very good quality | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.Q8_0.gguf) | Q8_0 | 4.8 | fast, best quality | +| [GGUF](https://huggingface.co/mradermacher/Murasaki-4B-v0.3-GGUF/resolve/main/Murasaki-4B-v0.3.f16.gguf) | f16 | 8.9 | 16 bpw, overkill | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. + +