prithivMLmods/Segue-Qwen3_DeepScaleR-Preview

Go to file

ModelHub XC 5db17b96b4 初始化项目，由ModelHub XC社区提供模型

Model: prithivMLmods/Segue-Qwen3_DeepScaleR-Preview
Source: Original Platform

2026-05-26 09:11:16 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

configuration.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

model-00001-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

model-00002-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 09:11:16 +08:00

README.md

license, datasets, base_model, language, pipeline_tag, library_name, tags

license

datasets

base_model

language

pipeline_tag

library_name

Segue-Qwen3_DeepScaleR-Preview

Segue-Qwen3_DeepScaleR-Preview is an experimental fine-tuned variant of the Qwen3-4B model architecture. It is trained on the DeepScaleR-Preview dataset—comprising high-quality mathematical reasoning problems—to achieve exceptional performance in symbolic, mathematical, and logical tasks with lightweight computational requirements.

Key Features

Precision Reasoning with DeepScaleR-Preview Dataset Fine-tuned on approximately 40,000 curated math problem-answer pairs sourced from:
- AIME (1984–2023)
- AMC (pre-2023)
- Omni-MATH This enables superior symbolic manipulation and step-by-step logical deduction.
Lightweight Code Understanding Capable of interpreting and generating correct code in Python, C++, and other logic-intensive languages with an emphasis on problem-solving and structured thought.
Structured Output Formatting Outputs are designed to be well-formatted in Markdown, JSON, LaTeX, or tables—ideal for technical documentation, math notebooks, and data workflows.
Instruction-Following Accuracy Strong multi-step instruction adherence, particularly for STEM domains. Ensures continuity, factual correctness, and process transparency in reasoning chains.
Multilingual Capabilities Supports over 20 languages for mathematical and logical reasoning, technical instruction translation, and cross-lingual academic support.
Efficient 4B Architecture Built on the Qwen3-4B base model to balance performance and scalability. Runs efficiently on mid-range GPUs while delivering high-accuracy inference.

Quickstart with Transformers

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "prithivMLmods/Segue-Qwen3_DeepScaleR-Preview"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Solve for x: 5(x - 2) = 3x + 4, showing all steps clearly."

messages = [
    {"role": "system", "content": "You are a precise mathematical assistant trained on DeepScaleR-Preview dataset."},
    {"role": "user", "content": prompt}
]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=512
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)

Intended Use

Step-by-step mathematical problem solving
Symbolic computation and logic derivation
Code generation and correction in technical environments
Automated LaTeX/Markdown/JSON generation for education and documentation
Academic tutoring and educational assistants
Multilingual reasoning and translation of structured content

Limitations

Less suitable for open-domain conversation or creative writing
Smaller context window compared to large-scale LLMs
May be sensitive to token formatting in edge-case symbolic prompts
Could underperform on intentionally adversarial logic inputs

References

Qwen2.5 Technical Report – https://arxiv.org/pdf/2412.15115
YaRN: Context Window Extension for LLMs – https://arxiv.org/pdf/2309.00071

README.md Unescape Escape

Segue-Qwen3_DeepScaleR-Preview

Key Features

Quickstart with Transformers

Intended Use

Limitations

References

README.md