初始化项目,由ModelHub XC社区提供模型

Model: cortexso/deepseek-r1-distill-qwen-1.5b
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-21 17:13:38 +08:00
commit 83b7339b4e
14 changed files with 167 additions and 0 deletions

45
.gitattributes vendored Normal file
View File

@@ -0,0 +1,45 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q2_k.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q3_k_l.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q3_k_m.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q3_k_s.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q4_k_s.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q5_k_s.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q6_k.gguf filter=lfs diff=lfs merge=lfs -text
deepseek-r1-distill-qwen-1.5b-q8_0.gguf filter=lfs diff=lfs merge=lfs -text

39
README.md Normal file
View File

@@ -0,0 +1,39 @@
---
license: mit
pipeline_tag: text-generation
tags:
- cortex.cpp
---
## Overview
**DeepSeek** developed and released the [DeepSeek R1 Distill Qwen 1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) model, a distilled version of the Qwen 1.5B language model. It is fine-tuned for high-performance text generation and optimized for dialogue and information-seeking tasks. This model achieves a balance of efficiency and accuracy while maintaining a smaller footprint compared to the original Qwen 1.5B.
The model is designed for applications in customer support, conversational AI, and research, prioritizing both helpfulness and safety.
## Variants
| No | Variant | Cortex CLI command |
| --- | --- | --- |
| 1 | [Deepseek-r1-distill-qwen-1.5b-1.5b](https://huggingface.co/cortexso/deepseek-r1-distill-qwen-1.5b/tree/1.5b) | `cortex run deepseek-r1-distill-qwen-1.5b:1.5b` |
## Use it with Jan (UI)
1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
2. Use in Jan model Hub:
```bash
cortexso/deepseek-r1-distill-qwen-1.5b
```
## Use it with Cortex (CLI)
1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
2. Run the model with command:
```bash
cortex run deepseek-r1-distill-qwen-1.5b
```
## Credits
- **Author:** DeepSeek
- **Converter:** [Homebrew](https://www.homebrew.ltd/)
- **Original License:** [License](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B#7-license)
- **Papers:** [DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning](https://arxiv.org/html/2501.12948v1)

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d0158a03fb9d3d9eb701b02b2a3d6006fbf2c42ef74c91fc01ca44b34e412d0e
size 752879904

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:65cb42bade66e46ba90bf43a0165883c263de32f0d0a1d67b03c694f3b8bceb5
size 980439840

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9235d49e507ef970d54784f905a3d5892ae3386ae726ad6840c392ae559cca73
size 924455712

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5c8a514f11865245b4d80f74d8314d31f1c165105ab9e989e87c59d98d4246d4
size 861221664

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3c47e9cc3399de9403f11a206ed59dcef2f9f8889eff71ed6f5a92a1b84453f1
size 1117320480

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:59500a4a0625421cb3cca5d0c09a30e5f50b56130d765bad57b682ad9fdde951
size 1071584544

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6faad9e18a4d5d59852bf58b81e3a7a704b6f0712786ce683d473751474ae967
size 1285494048

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0bfa62baa1a976bdb236a0e1e970f085cc19ec6c374206865f756fb7ce39367f
size 1259173152

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fa21e354d44b210e30100730676e81ed17d7c8f003307a327e4713e764099d7b
size 1464178464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6529f4b51da8765ecf67d2b94c3749e5f78ea86ae720e6504c0e7b50f384cb7d
size 1894531872

4
metadata.yml Normal file
View File

@@ -0,0 +1,4 @@
version: 1
name: deepseek-r1-distill-qwen-1.5b
default: 1.5b
author: "DeepSeek-AI"

49
model.yml Normal file
View File

@@ -0,0 +1,49 @@
# BEGIN GENERAL GGUF METADATA
id: deepseek-r1-distill-qwen-1.5b
model: deepseek-r1-distill-qwen-1.5b
name: deepseek-r1-distill-qwen-1.5b
version: 1
# END GENERAL GGUF METADATA
# BEGIN INFERENCE PARAMETERS
# BEGIN REQUIRED
stop:
- <|im_end|>
- "<\uFF5Cend\u2581of\u2581sentence\uFF5C>"
# END REQUIRED
# BEGIN OPTIONAL
stream: true
top_p: 0.9
temperature: 0.7
frequency_penalty: 0
presence_penalty: 0
max_tokens: 4096
seed: -1
dynatemp_range: 0
dynatemp_exponent: 1
top_k: 40
min_p: 0.05
tfs_z: 1
typ_p: 1
repeat_last_n: 64
repeat_penalty: 1
mirostat: false
mirostat_tau: 5
mirostat_eta: 0.100000001
penalize_nl: false
ignore_eos: false
n_probs: 0
min_keep: 0
# END OPTIONAL
# END INFERENCE PARAMETERS
# BEGIN MODEL LOAD PARAMETERS
# BEGIN REQUIRED
engine: llama-cpp
prompt_template: "|start_of_text|>{system_message}<\uFF5CUser\uFF5C>{prompt}<\uFF5C\
Assistant\uFF5C>"
ctx_len: 4096
ngl: 29
# END REQUIRED
# END MODEL LOAD PARAMETERS