9eeaef4124d440d903eb3b4c7baf7b7444fccbf9
Model: hyunseoki/verl-math-transfer-7bi-to-3bi-fix03 Source: Original Platform
library_name, pipeline_tag, tags
| library_name | pipeline_tag | tags | |||||||
|---|---|---|---|---|---|---|---|---|---|
| transformers | text-generation |
|
VERL Math Transfer 7B to 3B fix03
Math transfer experiment trained with verl. This repo groups all exported Hugging Face checkpoints for the 7B-to-3B fix_0_3 configuration.
Layout
main: latest exported checkpoint, currentlystep-130- step revisions:
step-010, step-020, step-030, step-040, step-050, step-060, step-070, step-080, step-090, step-100, step-110, step-120, step-130
Usage
from transformers import AutoTokenizer, AutoModelForCausalLM
repo_id = "hyunseoki/verl-math-transfer-7bi-to-3bi-fix03"
tokenizer = AutoTokenizer.from_pretrained(repo_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, trust_remote_code=True)
Load a specific checkpoint revision:
from transformers import AutoTokenizer, AutoModelForCausalLM
repo_id = "hyunseoki/verl-math-transfer-7bi-to-3bi-fix03"
revision = "step-130"
tokenizer = AutoTokenizer.from_pretrained(repo_id, revision=revision, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, revision=revision, trust_remote_code=True)
Notes
- Architecture detected from the exported config:
Qwen2ForCausalLM - The original base model Hub ID is not encoded in these local checkpoints, so
base_modelmetadata is not set automatically. - Checkpoints were exported from verl FSDP shards into Hugging Face
safetensorsformat.
Description
Languages
Jinja
100%