Model: rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged Source: Original Platform
language, library_name, pipeline_tag, tags, base_model
| language | library_name | pipeline_tag | tags | base_model | |||||
|---|---|---|---|---|---|---|---|---|---|
|
transformers | text-generation |
|
deepseek-ai/deepseek-llm-7b-chat |
gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Merged model fine-tuned from deepseek-ai/deepseek-llm-7b-chat on GSM8K using GRPO.
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged", torch_dtype="auto", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained("rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged")
Description
Model synced from source: rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Languages
Jinja
100%