初始化项目,由ModelHub XC社区提供模型
Model: smirki/Gradience-T1-3B-Merged-2000 Source: Original Platform
This commit is contained in:
32
README.md
Normal file
32
README.md
Normal file
@@ -0,0 +1,32 @@
|
||||
---
|
||||
library_name: transformers
|
||||
license: apache-2.0
|
||||
datasets:
|
||||
- Tesslate/Gradient-Reasoning
|
||||
language:
|
||||
- zho
|
||||
- eng
|
||||
- fra
|
||||
- spa
|
||||
- por
|
||||
- deu
|
||||
- ita
|
||||
- rus
|
||||
- jpn
|
||||
- kor
|
||||
- vie
|
||||
- tha
|
||||
- ara
|
||||
base_model:
|
||||
- Qwen/Qwen2.5-3B-Instruct
|
||||
---
|
||||
|
||||
# Model Card for Gradience-3B
|
||||
|
||||
This model is still in preview/beta. We're still working on it! This is just so the community can try out our new "Gradient Reasoning" that intends to break problems down and reason faster.
|
||||
|
||||
|
||||
You can use a system prompt to enable thinking:
|
||||
"First, think step-by-step to reach the solution. Enclose your entire reasoning process within <|begin_of_thought|> and <|end_of_thought|> tags."
|
||||
You can try sampling params:
|
||||
Temp: 0.76, TopP: 0.62, Topk 30-68, Rep: 1.0, minp: 0.05
|
||||
Reference in New Issue
Block a user