初始化项目,由ModelHub XC社区提供模型
Model: cortexso/deepseek-r1-distill-qwen-1.5b Source: Original Platform
This commit is contained in:
45
.gitattributes
vendored
Normal file
45
.gitattributes
vendored
Normal file
@@ -0,0 +1,45 @@
|
|||||||
|
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.model filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||||
|
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q2_k.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q3_k_l.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q3_k_m.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q3_k_s.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q4_k_s.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q5_k_s.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q6_k.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
deepseek-r1-distill-qwen-1.5b-q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
39
README.md
Normal file
39
README.md
Normal file
@@ -0,0 +1,39 @@
|
|||||||
|
---
|
||||||
|
license: mit
|
||||||
|
pipeline_tag: text-generation
|
||||||
|
tags:
|
||||||
|
- cortex.cpp
|
||||||
|
---
|
||||||
|
## Overview
|
||||||
|
|
||||||
|
**DeepSeek** developed and released the [DeepSeek R1 Distill Qwen 1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) model, a distilled version of the Qwen 1.5B language model. It is fine-tuned for high-performance text generation and optimized for dialogue and information-seeking tasks. This model achieves a balance of efficiency and accuracy while maintaining a smaller footprint compared to the original Qwen 1.5B.
|
||||||
|
|
||||||
|
The model is designed for applications in customer support, conversational AI, and research, prioritizing both helpfulness and safety.
|
||||||
|
|
||||||
|
## Variants
|
||||||
|
|
||||||
|
| No | Variant | Cortex CLI command |
|
||||||
|
| --- | --- | --- |
|
||||||
|
| 1 | [Deepseek-r1-distill-qwen-1.5b-1.5b](https://huggingface.co/cortexso/deepseek-r1-distill-qwen-1.5b/tree/1.5b) | `cortex run deepseek-r1-distill-qwen-1.5b:1.5b` |
|
||||||
|
|
||||||
|
|
||||||
|
## Use it with Jan (UI)
|
||||||
|
|
||||||
|
1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
|
||||||
|
2. Use in Jan model Hub:
|
||||||
|
```bash
|
||||||
|
cortexso/deepseek-r1-distill-qwen-1.5b
|
||||||
|
```
|
||||||
|
## Use it with Cortex (CLI)
|
||||||
|
|
||||||
|
1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
|
||||||
|
2. Run the model with command:
|
||||||
|
```bash
|
||||||
|
cortex run deepseek-r1-distill-qwen-1.5b
|
||||||
|
```
|
||||||
|
## Credits
|
||||||
|
|
||||||
|
- **Author:** DeepSeek
|
||||||
|
- **Converter:** [Homebrew](https://www.homebrew.ltd/)
|
||||||
|
- **Original License:** [License](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B#7-license)
|
||||||
|
- **Papers:** [DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning](https://arxiv.org/html/2501.12948v1)
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q2_k.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q2_k.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:d0158a03fb9d3d9eb701b02b2a3d6006fbf2c42ef74c91fc01ca44b34e412d0e
|
||||||
|
size 752879904
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q3_k_l.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q3_k_l.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:65cb42bade66e46ba90bf43a0165883c263de32f0d0a1d67b03c694f3b8bceb5
|
||||||
|
size 980439840
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q3_k_m.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q3_k_m.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:9235d49e507ef970d54784f905a3d5892ae3386ae726ad6840c392ae559cca73
|
||||||
|
size 924455712
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q3_k_s.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q3_k_s.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:5c8a514f11865245b4d80f74d8314d31f1c165105ab9e989e87c59d98d4246d4
|
||||||
|
size 861221664
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q4_k_m.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q4_k_m.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:3c47e9cc3399de9403f11a206ed59dcef2f9f8889eff71ed6f5a92a1b84453f1
|
||||||
|
size 1117320480
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q4_k_s.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q4_k_s.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:59500a4a0625421cb3cca5d0c09a30e5f50b56130d765bad57b682ad9fdde951
|
||||||
|
size 1071584544
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q5_k_m.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q5_k_m.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:6faad9e18a4d5d59852bf58b81e3a7a704b6f0712786ce683d473751474ae967
|
||||||
|
size 1285494048
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q5_k_s.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q5_k_s.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:0bfa62baa1a976bdb236a0e1e970f085cc19ec6c374206865f756fb7ce39367f
|
||||||
|
size 1259173152
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q6_k.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q6_k.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:fa21e354d44b210e30100730676e81ed17d7c8f003307a327e4713e764099d7b
|
||||||
|
size 1464178464
|
||||||
3
deepseek-r1-distill-qwen-1.5b-q8_0.gguf
Normal file
3
deepseek-r1-distill-qwen-1.5b-q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:6529f4b51da8765ecf67d2b94c3749e5f78ea86ae720e6504c0e7b50f384cb7d
|
||||||
|
size 1894531872
|
||||||
4
metadata.yml
Normal file
4
metadata.yml
Normal file
@@ -0,0 +1,4 @@
|
|||||||
|
version: 1
|
||||||
|
name: deepseek-r1-distill-qwen-1.5b
|
||||||
|
default: 1.5b
|
||||||
|
author: "DeepSeek-AI"
|
||||||
49
model.yml
Normal file
49
model.yml
Normal file
@@ -0,0 +1,49 @@
|
|||||||
|
# BEGIN GENERAL GGUF METADATA
|
||||||
|
id: deepseek-r1-distill-qwen-1.5b
|
||||||
|
model: deepseek-r1-distill-qwen-1.5b
|
||||||
|
name: deepseek-r1-distill-qwen-1.5b
|
||||||
|
version: 1
|
||||||
|
# END GENERAL GGUF METADATA
|
||||||
|
|
||||||
|
# BEGIN INFERENCE PARAMETERS
|
||||||
|
# BEGIN REQUIRED
|
||||||
|
stop:
|
||||||
|
- <|im_end|>
|
||||||
|
- "<\uFF5Cend\u2581of\u2581sentence\uFF5C>"
|
||||||
|
# END REQUIRED
|
||||||
|
|
||||||
|
# BEGIN OPTIONAL
|
||||||
|
stream: true
|
||||||
|
top_p: 0.9
|
||||||
|
temperature: 0.7
|
||||||
|
frequency_penalty: 0
|
||||||
|
presence_penalty: 0
|
||||||
|
max_tokens: 4096
|
||||||
|
seed: -1
|
||||||
|
dynatemp_range: 0
|
||||||
|
dynatemp_exponent: 1
|
||||||
|
top_k: 40
|
||||||
|
min_p: 0.05
|
||||||
|
tfs_z: 1
|
||||||
|
typ_p: 1
|
||||||
|
repeat_last_n: 64
|
||||||
|
repeat_penalty: 1
|
||||||
|
mirostat: false
|
||||||
|
mirostat_tau: 5
|
||||||
|
mirostat_eta: 0.100000001
|
||||||
|
penalize_nl: false
|
||||||
|
ignore_eos: false
|
||||||
|
n_probs: 0
|
||||||
|
min_keep: 0
|
||||||
|
# END OPTIONAL
|
||||||
|
# END INFERENCE PARAMETERS
|
||||||
|
|
||||||
|
# BEGIN MODEL LOAD PARAMETERS
|
||||||
|
# BEGIN REQUIRED
|
||||||
|
engine: llama-cpp
|
||||||
|
prompt_template: "|start_of_text|>{system_message}<\uFF5CUser\uFF5C>{prompt}<\uFF5C\
|
||||||
|
Assistant\uFF5C>"
|
||||||
|
ctx_len: 4096
|
||||||
|
ngl: 29
|
||||||
|
# END REQUIRED
|
||||||
|
# END MODEL LOAD PARAMETERS
|
||||||
Reference in New Issue
Block a user