Files
ModelHub XC 3fe59bd4d7 初始化项目,由ModelHub XC社区提供模型
Model: prithivMLmods/Primus-Optima-QwenKV-1.54B
Source: Original Platform
2026-05-24 02:55:13 +08:00

101 lines
3.8 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
library_name: transformers
pipeline_tag: text-generation
license: apache-2.0
language:
- en
- zh
base_model:
- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
tags:
- text-generation-inference
- Code
- Math
- RL
- R1
---
![KV.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/7WWuZljYRluVp5gi3--9a.png)
# **Primus-Optima-QwenKV-1.54B**
> **Primus-Optima-QwenKV-1.54B** is an **experimental chain-of-thought reasoning and code generation model**, built by combining the strengths of two sources:
>
> - **DeepSeek R1 (distilled 1.5B)** for strong math and coding reasoning.
> - **Qwen2.5-0.5B**, fine-tuned with **Process Reward Models (PRM)** to boost structured step-by-step outputs in math and logic.
This hybrid design results in a **bilingual, high-precision model** with enhanced **reasoning depth**, **multi-step clarity**, and **lightweight adaptability** for math and code applications.
## **Key Features**
1. **Chain-of-Thought Reasoning for Math + Code**
Designed to produce human-like intermediate steps in both math and programming problems — useful for education, tutoring, and technical assistants.
2. **Hybrid Architecture (Reasoning + Reward-Guided Fine-Tuning)**
Combines **DeepSeek R1s** distilled capabilities with **Qwen2.5-0.5B**'s reward-optimized reasoning for structured, goal-driven outputs.
3. **Multilingual Capabilities (English + 中文)**
Fluent and accurate in both English and Simplified Chinese, making it suitable for diverse learning and development environments.
4. **Coder Experimental Mode**
Able to solve algorithmic tasks, complete functions, and offer code walkthroughs using the same step-by-step format as it does for math.
5. **Lightweight Yet Capable (1.54B)**
With just 1.54B parameters, it is efficient for local deployments while offering surprisingly strong performance on STEM and programming tasks.
## **Quickstart with Transformers**
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "prithivMLmods/Primus-Optima-QwenKV-1.54B"
model = AutoModelForCausalLM.from_pretrained(
model_name,
torch_dtype="auto",
device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)
prompt = "Write a Python function to compute factorial using recursion."
messages = [
{"role": "system", "content": "You are an expert tutor in math and programming, explaining step-by-step."},
{"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
generated_ids = model.generate(
**model_inputs,
max_new_tokens=512
)
generated_ids = [
output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
```
## **Intended Use**
- **Math & Programming Tutors**: Assist students with logic-driven step-by-step explanations.
- **Bilingual STEM Apps**: Ideal for dual-language math or coding environments.
- **Competitive Reasoning Tools**: Suited for reasoning-intensive tasks like Olympiad prep, technical quizzes, and programming challenges.
- **On-Device LLMs**: Lightweight enough for web or embedded applications needing real-time reasoning.
## **Limitations**
1. **Experimental Nature**:
This is a hybrid research model; performance may vary across general or creative domains.
2. **Size Constraints**:
As a 1.54B parameter model, extremely complex reasoning tasks may challenge its capabilities.
3. **Bias & Generalization**:
Inherits biases from both DeepSeek R1 and Qwen2.5. Use caution in high-stakes or sensitive applications.
4. **Prompt Engineering Required**:
Structured prompts with clear questions yield the best results, especially for multi-step problems.