Files
gsm8k-deepseek-llm-7b-chat-…/README.md
ModelHub XC cc7fa7a606 初始化项目,由ModelHub XC社区提供模型
Model: rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Source: Original Platform
2026-04-22 16:34:06 +08:00

700 B

language, library_name, pipeline_tag, tags, base_model
language library_name pipeline_tag tags base_model
en
transformers text-generation
grpo
gsm8k
math
lora
deepseek-ai/deepseek-llm-7b-chat

gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged

Merged model fine-tuned from deepseek-ai/deepseek-llm-7b-chat on GSM8K using GRPO.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged", torch_dtype="auto", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained("rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged")