commit 49b256d34df975a72bb04acba7818ab9546710ff Author: ModelHub XC Date: Fri Jun 19 14:10:12 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: prithivMLmods/SmolLM2-Rethink-360M-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..53d7257 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,47 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bin.* filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zstandard filter=lfs diff=lfs merge=lfs -text +*.tfevents* filter=lfs diff=lfs merge=lfs -text +*.db* filter=lfs diff=lfs merge=lfs -text +*.ark* filter=lfs diff=lfs merge=lfs -text +**/*ckpt*data* filter=lfs diff=lfs merge=lfs -text +**/*ckpt*.meta filter=lfs diff=lfs merge=lfs -text +**/*ckpt*.index filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.gguf* filter=lfs diff=lfs merge=lfs -text +*.ggml filter=lfs diff=lfs merge=lfs -text +*.llamafile* filter=lfs diff=lfs merge=lfs -text +*.pt2 filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text \ No newline at end of file diff --git a/README.md b/README.md new file mode 100644 index 0000000..7b29252 --- /dev/null +++ b/README.md @@ -0,0 +1,45 @@ +--- +license: apache-2.0 +language: +- en +base_model: +- prithivMLmods/SmolLM2-Rethink-360M +pipeline_tag: text-generation +library_name: transformers +tags: +- text-generation-inference +- trl +--- +# **SmolLM2-Rethink-360M-GGUF** + +> SmolLM2-Rethink-360M is an experimental lightweight reasoning model trained on the Celestia3-DeepSeek-R1-0528 dataset. Built on top of the SmolLM2-135M-Instruct architecture and scaled to 360M parameters, it is designed to enhance lightweight reasoning, logical deduction, and structured response generation—all while maintaining efficiency for resource-constrained environments. + +## Model Files + +| File Name | Size | Type | Description | +|-----------|------|------|-------------| +| SmolLM2-Rethink-360M.Q2_K.gguf | 219 MB | Model | Q2_K quantized model (smallest) | +| SmolLM2-Rethink-360M.Q3_K_S.gguf | 219 MB | Model | Q3_K_S quantized model | +| SmolLM2-Rethink-360M.Q3_K_M.gguf | 235 MB | Model | Q3_K_M quantized model | +| SmolLM2-Rethink-360M.Q3_K_L.gguf | 246 MB | Model | Q3_K_L quantized model | +| SmolLM2-Rethink-360M.Q4_K_S.gguf | 260 MB | Model | Q4_K_S quantized model | +| SmolLM2-Rethink-360M.Q4_K_M.gguf | 271 MB | Model | Q4_K_M quantized model | +| SmolLM2-Rethink-360M.Q5_K_S.gguf | 283 MB | Model | Q5_K_S quantized model | +| SmolLM2-Rethink-360M.Q5_K_M.gguf | 290 MB | Model | Q5_K_M quantized model | +| SmolLM2-Rethink-360M.Q6_K.gguf | 367 MB | Model | Q6_K quantized model | +| SmolLM2-Rethink-360M.Q8_0.gguf | 386 MB | Model | Q8_0 quantized model | +| SmolLM2-Rethink-360M.BF16.gguf | 726 MB | Model | BF16 precision model | +| SmolLM2-Rethink-360M.F16.gguf | 726 MB | Model | F16 precision model | +| SmolLM2-Rethink-360M.F32.gguf | 1.45 GB | Model | F32 full precision model (largest) | +| .gitattributes | 2.4 kB | Config | Git LFS configuration | +| config.json | 29 Bytes | Config | Model configuration | +| README.md | 31 Bytes | Documentation | Repository documentation | + +## Quants Usage + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) \ No newline at end of file diff --git a/SmolLM2-Rethink-360M.BF16.gguf b/SmolLM2-Rethink-360M.BF16.gguf new file mode 100644 index 0000000..741e3e7 --- /dev/null +++ b/SmolLM2-Rethink-360M.BF16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3fe3247a08155dda9826542d7eebec4999ddc5ee8576871759cf445e3880a0ae +size 725553440 diff --git a/SmolLM2-Rethink-360M.F16.gguf b/SmolLM2-Rethink-360M.F16.gguf new file mode 100644 index 0000000..5f4f97d --- /dev/null +++ b/SmolLM2-Rethink-360M.F16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5dc70c795eeaccd0444086ca5a1d257acb250b1d9bea409dab996a45f95531de +size 725553440 diff --git a/SmolLM2-Rethink-360M.F32.gguf b/SmolLM2-Rethink-360M.F32.gguf new file mode 100644 index 0000000..a878439 --- /dev/null +++ b/SmolLM2-Rethink-360M.F32.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:668b64a6d54033cc514305bfddf9c129ffe49e289485e32b5af46bcbe1ef0d1d +size 1449070880 diff --git a/SmolLM2-Rethink-360M.Q2_K.gguf b/SmolLM2-Rethink-360M.Q2_K.gguf new file mode 100644 index 0000000..92846d2 --- /dev/null +++ b/SmolLM2-Rethink-360M.Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:71a43dae31d8d4aaa28439567f62a1e256fcfb013a84f03d632cb003def6c24d +size 218673440 diff --git a/SmolLM2-Rethink-360M.Q3_K_L.gguf b/SmolLM2-Rethink-360M.Q3_K_L.gguf new file mode 100644 index 0000000..caca45d --- /dev/null +++ b/SmolLM2-Rethink-360M.Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0de9c29fc11b855b4c55e3f34f045a560df04a16b05af325cfde85f7f6408005 +size 246321440 diff --git a/SmolLM2-Rethink-360M.Q3_K_M.gguf b/SmolLM2-Rethink-360M.Q3_K_M.gguf new file mode 100644 index 0000000..820699c --- /dev/null +++ b/SmolLM2-Rethink-360M.Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e7bccc60921c493c7ef596a047463e949d27c88445847b60336b2a4b3e1891df +size 234686240 diff --git a/SmolLM2-Rethink-360M.Q3_K_S.gguf b/SmolLM2-Rethink-360M.Q3_K_S.gguf new file mode 100644 index 0000000..b112eb0 --- /dev/null +++ b/SmolLM2-Rethink-360M.Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c9e733653c2df6966556a663093333b83d7e20bea59d9969a7f53844024bc8b3 +size 218673440 diff --git a/SmolLM2-Rethink-360M.Q4_K_M.gguf b/SmolLM2-Rethink-360M.Q4_K_M.gguf new file mode 100644 index 0000000..14d9e79 --- /dev/null +++ b/SmolLM2-Rethink-360M.Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0a73603bc39b3756fec8959446a6be20db5fdc86b9db25279eef7eb2b0e92ba2 +size 270590240 diff --git a/SmolLM2-Rethink-360M.Q4_K_S.gguf b/SmolLM2-Rethink-360M.Q4_K_S.gguf new file mode 100644 index 0000000..ee78850 --- /dev/null +++ b/SmolLM2-Rethink-360M.Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:86644dd96afa6361aae4a8d134495fb306252ae5832ef49b751329624f9e1b3e +size 259915040 diff --git a/SmolLM2-Rethink-360M.Q5_K_M.gguf b/SmolLM2-Rethink-360M.Q5_K_M.gguf new file mode 100644 index 0000000..3481871 --- /dev/null +++ b/SmolLM2-Rethink-360M.Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a0388e5ede30e68f0601a763fa55e78a45d79193cb35171fb5898cdae3189e2a +size 289943840 diff --git a/SmolLM2-Rethink-360M.Q5_K_S.gguf b/SmolLM2-Rethink-360M.Q5_K_S.gguf new file mode 100644 index 0000000..fcb8213 --- /dev/null +++ b/SmolLM2-Rethink-360M.Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b714acf0754f9a02fce2df317a195b31d8743bda10572358037d28bf126aa0e4 +size 283185440 diff --git a/SmolLM2-Rethink-360M.Q6_K.gguf b/SmolLM2-Rethink-360M.Q6_K.gguf new file mode 100644 index 0000000..999c8d1 --- /dev/null +++ b/SmolLM2-Rethink-360M.Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:582a839e1edffb8396980b1f6d2a55b16a6c2a7910f96a69fb0dd83f256ac113 +size 367358240 diff --git a/SmolLM2-Rethink-360M.Q8_0.gguf b/SmolLM2-Rethink-360M.Q8_0.gguf new file mode 100644 index 0000000..e37a870 --- /dev/null +++ b/SmolLM2-Rethink-360M.Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:00fec9bb3ab3dcd254f0a127a6ee71e1733e9cdafad68d55a8a6fca74add6755 +size 386404640 diff --git a/config.json b/config.json new file mode 100644 index 0000000..a4ba21b --- /dev/null +++ b/config.json @@ -0,0 +1,3 @@ +{ + "model_type": "llama" +} \ No newline at end of file diff --git a/configuration.json b/configuration.json new file mode 100644 index 0000000..bbeeda1 --- /dev/null +++ b/configuration.json @@ -0,0 +1 @@ +{"framework": "pytorch", "task": "text-generation", "allow_remote": true} \ No newline at end of file