Files
Viper-Coder-v0.1/README.md
ModelHub XC a4c18fccb9 初始化项目,由ModelHub XC社区提供模型
Model: prithivMLmods/Viper-Coder-v0.1
Source: Original Platform
2026-06-06 21:42:14 +08:00

7.4 KiB

license, language, base_model, pipeline_tag, library_name, tags, model-index
license language base_model pipeline_tag library_name tags model-index
apache-2.0
en
prithivMLmods/Jolt-v0.1
text-generation transformers
qwen-optimized-coder
viper🐍
coder
text-generation-inference
name results
Viper-Coder-v0.1
task dataset metrics source
type name
text-generation Text Generation
name type split args
IFEval (0-Shot) wis-k/instruction-following-eval train
num_few_shot
0
type value name
inst_level_strict_acc and prompt_level_strict_acc 55.21 averaged accuracy
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=prithivMLmods%2FViper-Coder-v0.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
BBH (3-Shot) SaylorTwift/bbh test
num_few_shot
3
type value name
acc_norm 44.63 normalized accuracy
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=prithivMLmods%2FViper-Coder-v0.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
MATH Lvl 5 (4-Shot) lighteval/MATH-Hard test
num_few_shot
4
type value name
exact_match 31.87 exact match
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=prithivMLmods%2FViper-Coder-v0.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
GPQA (0-shot) Idavidrein/gpqa train
num_few_shot
0
type value name
acc_norm 13.87 acc_norm
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=prithivMLmods%2FViper-Coder-v0.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type args
MuSR (0-shot) TAUR-Lab/MuSR
num_few_shot
0
type value name
acc_norm 13.03 acc_norm
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=prithivMLmods%2FViper-Coder-v0.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU-PRO (5-shot) TIGER-Lab/MMLU-Pro main test
num_few_shot
5
type value name
acc 32.53 accuracy
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=prithivMLmods%2FViper-Coder-v0.1 Open LLM Leaderboard

coderx.png

Viper-Coder-v0.1

Viper-Coder-v0.1 is based on the Qwen 2.5 14B modality architecture, designed to enhance the reasoning capabilities of 14B-parameter models. It has been fine-tuned on a synthetic dataset based on the latest coding logits and CoT datasets, further optimizing its chain-of-thought (CoT) reasoning and logical problem-solving abilities. The model demonstrates significant improvements in context understanding, structured data processing, and long-context comprehension, making it ideal for complex reasoning tasks, instruction-following, and text generation.

Key Improvements

  1. Enhanced Knowledge and Expertise: Improved mathematical reasoning, coding proficiency, and structured data processing.
  2. Fine-Tuned Instruction Following: Optimized for precise responses, structured outputs (e.g., JSON), and generating long texts (8K+ tokens).
  3. Greater Adaptability: Better role-playing capabilities and resilience to diverse system prompts.
  4. Long-Context Support: Handles up to 128K tokens and generates up to 8K tokens per output.
  5. Multilingual Proficiency: Supports over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, and more.

Quickstart with Transformers

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "prithivMLmods/Viper-Coder-v0.1"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto",
    trust_remote_code=True
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language models."
messages = [
    {"role": "system", "content": "You are an advanced AI assistant with expert-level reasoning and knowledge."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=512
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)

Intended Use

  • Advanced Reasoning & Context Understanding: Designed for logical deduction, multi-step problem-solving, and complex knowledge-based tasks.
  • Mathematical & Scientific Problem-Solving: Enhanced capabilities for calculations, theorem proving, and scientific queries.
  • Code Generation & Debugging: Generates and optimizes code across multiple programming languages.
  • Structured Data Analysis: Processes tables, JSON, and structured outputs, making it ideal for data-centric tasks.
  • Multilingual Applications: High proficiency in over 29 languages, enabling global-scale applications.
  • Extended Content Generation: Supports detailed document writing, research reports, and instructional guides.

Limitations

  1. High Computational Requirements: Due to its 14B parameters and 128K context support, it requires powerful GPUs or TPUs for efficient inference.
  2. Language-Specific Variability: Performance may vary across supported languages, especially for low-resource languages.
  3. Potential Error Accumulation: Long-text generation can sometimes introduce inconsistencies over extended outputs.
  4. Limited Real-World Awareness: Knowledge is restricted to training data and may not reflect recent world events.
  5. Prompt Sensitivity: Outputs can depend on the specificity and clarity of the input prompt.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here! Summarized results can be found here!

Metric Value (%)
Average 31.86
IFEval (0-Shot) 55.21
BBH (3-Shot) 44.63
MATH Lvl 5 (4-Shot) 31.87
GPQA (0-shot) 13.87
MuSR (0-shot) 13.03
MMLU-PRO (5-shot) 32.53