prithivMLmods/Bootes-Qwen3_Coder-Reasoning

Go to file

ModelHub XC a1d9c8f7fb 初始化项目，由ModelHub XC社区提供模型

Model: prithivMLmods/Bootes-Qwen3_Coder-Reasoning
Source: Original Platform

2026-05-19 12:01:04 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

configuration.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

model-00001-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

model-00002-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 12:01:04 +08:00

README.md

license, base_model, datasets, language, pipeline_tag, library_name, tags

license

base_model

datasets

language

pipeline_tag

library_name

Bootes-Qwen3_Coder-Reasoning

Bootes-Qwen3_Coder-Reasoning is a fine-tuned variant of the Qwen3-4B architecture, optimized for high-accuracy code reasoning and structured logical task completion. Trained on the CodeAlpaca_20K dataset and additional curated programming corpora, this model is designed to perform technical coding, reasoning, and instruction-following tasks with lightweight computational requirements.

Note

GGUF : https://huggingface.co/prithivMLmods/Bootes-Qwen3_Coder-Reasoning-Q4_K_M-GGUF

Key Features

Code Reasoning with CodeAlpaca_20K and More Fine-tuned on CodeAlpaca_20K and supplementary high-quality datasets focused on:
- Multi-language programming tasks
- Code explanation, completion, and debugging
- Instruction-following with step-wise execution logic
Cross-Language Code Understanding Handles Python, JavaScript, C++, and more. Ideal for code generation, transformation, bug-fixing, and logic validation.
Structured Output Generation Delivers responses in Markdown, JSON, YAML, and structured code blocks. Optimized for IDE workflows, documentation tools, and reproducible computation notebooks.
Instruction-Tuned for Developer Use Cases Maintains strong fidelity to user prompts, especially multi-turn or step-by-step technical instructions across engineering and data workflows.
Multilingual Reasoning in Technical Domains Capable of technical comprehension and explanation in over 20 human languages, supporting global developer audiences.
Efficient 4B Architecture Based on Qwen3-4B for a performance-efficient inference model that scales well on mid-range GPUs and cloud deployment setups.

Quickstart with Transformers🤗

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "prithivMLmods/Bootes-Qwen3_Coder-Reasoning"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Write a Python function to check whether a number is a palindrome. Explain each step."

messages = [
    {"role": "system", "content": "You are a precise coding and reasoning assistant trained on CodeAlpaca and developer datasets."},
    {"role": "user", "content": prompt}
]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=512
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)

Intended Use

Code generation, completion, and explanation
Multi-step algorithmic reasoning
Structured technical document generation (Markdown, JSON, YAML)
Debugging assistance and refactoring suggestions
Technical tutoring and developer assistant workflows
Cross-lingual programming education and translation

Limitations

May underperform on non-code-related creative writing
Limited context window versus larger models
Sensitive to prompt phrasing for ambiguous instructions
Occasionally over-justifies code when brevity is desired

References

Qwen2.5 Technical Report – https://arxiv.org/pdf/2412.15115
CodeAlpaca Dataset – https://github.com/sahil280114/codealpaca
YaRN: Context Window Extension for LLMs – https://arxiv.org/pdf/2309.00071

README.md Unescape Escape

Bootes-Qwen3_Coder-Reasoning

Key Features

Quickstart with Transformers🤗

Intended Use

Limitations

References

README.md