初始化项目,由ModelHub XC社区提供模型

Model: prithivMLmods/Omega-Qwen2.5-Coder-3B
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-24 00:37:12 +08:00
commit 8064bccca6
15 changed files with 246 additions and 0 deletions

36
.gitattributes vendored Normal file
View File

@@ -0,0 +1,36 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
tokenizer.json filter=lfs diff=lfs merge=lfs -text

111
README.md Normal file
View File

@@ -0,0 +1,111 @@
---
license: apache-2.0
tags:
- 'Thinking: Disabled'
- Forge
- code
- mot
- stem
- coder
- trl
datasets:
- prithivMLmods/Open-Omega-Forge-1M
language:
- en
- zh
base_model:
- Qwen/Qwen2.5-Coder-3B-Instruct
pipeline_tag: text-generation
library_name: transformers
---
![9.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/RXFgDRiO1VBm6jJcH2TPr.png)
# **Omega-Qwen2.5-Coder-3B**
> **Omega-Qwen2.5-Coder-3B** is a compact and high-efficiency code-focused model fine-tuned on **Qwen2.5-Coder-3B-Instruct**, using the symbolic-rich **Open-Omega-Forge-1M** dataset. Designed specifically for **hard-coded tasks** and deterministic computation, this model runs in a **"thinking-disabled"** mode—delivering precise, structured outputs with minimal hallucination, making it ideal for rigorous coding workflows and embedded logic applications.
> Thinking: Disabled
> \[!note]
> GGUF: [https://huggingface.co/prithivMLmods/Omega-Qwen2.5-Coder-3B-GGUF](https://huggingface.co/prithivMLmods/Omega-Qwen2.5-Coder-3B-GGUF)
## **Key Features**
1. **Purpose-Built for Hard Coding**
Specially tuned to perform precise, low-level code generation with **minimal reasoning overhead**. Ideal for edge-case algorithms, embedded scripting, and deterministic logic patterns.
2. **Optimized Qwen2.5 Foundation**
Built on **Qwen2.5-Coder-3B-Instruct**, benefiting from its robust token handling, instruction following, and multilingual code representation.
3. **Backed by Open-Omega-Forge-1M Dataset**
Trained on a curated mix of code, math, and logic problems focused on **symbolic clarity** and **STEM coherence**, drawn from sources like OpenCodeReasoning, MathX-5M, OpenMathReasoning, and more.
4. **Thinking Disabled Mode**
The model avoids overgeneralizing or injecting speculative reasoning. It executes tasks **as-is**—perfect for structured prompts, tight constraints, and automation pipelines.
5. **Structured Output Control**
Outputs in **JSON**, **YAML**, **Python**, **Markdown**, and **LaTeX**, tailored for script generation, data serialization, and scientific formatting.
6. **Efficient 3B Deployment**
Lightweight and scalable for **mid-tier GPUs**, **offline dev environments**, or **local inference systems**, while maintaining solid performance on symbolic tasks.
---
## **Quickstart with Transformers**
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "prithivMLmods/Omega-Qwen2.5-Coder-3B"
model = AutoModelForCausalLM.from_pretrained(
model_name,
torch_dtype="auto",
device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)
prompt = "Write a Python function to compute factorial iteratively."
messages = [
{"role": "system", "content": "You are a deterministic code generator. No assumptions. No extra explanations."},
{"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
generated_ids = model.generate(
**model_inputs,
max_new_tokens=256
)
generated_ids = [
output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)
```
---
## **Intended Use**
* Embedded logic and deterministic function generation
* Script automation and toolchain integration
* Codegen under fixed constraints or symbolic inputs
* Lightweight STEM applications on edge devices or offline clusters
* Tools where **"no thinking" = better stability**
## **Limitations**
* Not suitable for high-level reasoning or open-ended thought processes
* General chat performance is minimal by design
* Lacks emotional intelligence or creative composition capability
* Assumes user provides **clear, explicit instructions** for best results

25
added_tokens.json Normal file
View File

@@ -0,0 +1,25 @@
{
"</tool_call>": 151658,
"<tool_call>": 151657,
"<|PAD_TOKEN|>": 151665,
"<|box_end|>": 151649,
"<|box_start|>": 151648,
"<|endoftext|>": 151643,
"<|file_sep|>": 151664,
"<|fim_middle|>": 151660,
"<|fim_pad|>": 151662,
"<|fim_prefix|>": 151659,
"<|fim_suffix|>": 151661,
"<|im_end|>": 151645,
"<|im_start|>": 151644,
"<|image_pad|>": 151655,
"<|object_ref_end|>": 151647,
"<|object_ref_start|>": 151646,
"<|quad_end|>": 151651,
"<|quad_start|>": 151650,
"<|repo_name|>": 151663,
"<|video_pad|>": 151656,
"<|vision_end|>": 151653,
"<|vision_pad|>": 151654,
"<|vision_start|>": 151652
}

3
chat_template.jinja Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:cd8e9439f0570856fd70470bf8889ebd8b5d1107207f67a5efb46e342330527f
size 2507

3
config.json Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f1bca299dc2fb89864ee027416294605c5ffe1122eeefc7b2c37eeae5f2d3c93
size 1525

1
configuration.json Normal file
View File

@@ -0,0 +1 @@
{"framework": "pytorch", "task": "text-generation", "allow_remote": true}

15
generation_config.json Normal file
View File

@@ -0,0 +1,15 @@
{
"bos_token_id": 151643,
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"max_length": 32768,
"pad_token_id": 151665,
"repetition_penalty": 1.05,
"temperature": 0.7,
"top_k": 20,
"top_p": 0.8,
"transformers_version": "4.53.2"
}

BIN
merges.txt (Stored with Git LFS) Normal file

Binary file not shown.

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:14ed3243801eb6f5f0616d3cdd0e40063310853f0dda754dc0335c283eb3e672
size 4957560304

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0bb52416c1dcbc354ac7262f42c5104eb83169c46ccc1b7d08ec9ad2c98e6991
size 1214366696

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7b85f79fd7612df2e5d7f03d7f92d65c8967f790a331146ca7f45658823060d3
size 35581

31
special_tokens_map.json Normal file
View File

@@ -0,0 +1,31 @@
{
"additional_special_tokens": [
"<|im_start|>",
"<|im_end|>",
"<|object_ref_start|>",
"<|object_ref_end|>",
"<|box_start|>",
"<|box_end|>",
"<|quad_start|>",
"<|quad_end|>",
"<|vision_start|>",
"<|vision_end|>",
"<|vision_pad|>",
"<|image_pad|>",
"<|video_pad|>"
],
"eos_token": {
"content": "<|im_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"pad_token": {
"content": "<|PAD_TOKEN|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}

3
tokenizer.json Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fab42efe8d17406525a9154b728cf9e957629a8ed7ce997770efdd71128c6a1a
size 11422086

3
tokenizer_config.json Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2bcc8b527e0f7b3f5ceb74e13708b004316aa970972a1399b199c6b5f9cd243e
size 4892

BIN
vocab.json (Stored with Git LFS) Normal file

Binary file not shown.