Files
verl-math-transfer-7bi-to-3…/README.md
ModelHub XC 9eeaef4124 初始化项目,由ModelHub XC社区提供模型
Model: hyunseoki/verl-math-transfer-7bi-to-3bi-fix03
Source: Original Platform
2026-05-28 14:21:00 +08:00

1.5 KiB

library_name, pipeline_tag, tags
library_name pipeline_tag tags
transformers text-generation
verl
math
grpo
transfer
qwen2
3b
7bi-to-3bi

VERL Math Transfer 7B to 3B fix03

Math transfer experiment trained with verl. This repo groups all exported Hugging Face checkpoints for the 7B-to-3B fix_0_3 configuration.

Layout

  • main: latest exported checkpoint, currently step-130
  • step revisions: step-010, step-020, step-030, step-040, step-050, step-060, step-070, step-080, step-090, step-100, step-110, step-120, step-130

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM

repo_id = "hyunseoki/verl-math-transfer-7bi-to-3bi-fix03"
tokenizer = AutoTokenizer.from_pretrained(repo_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, trust_remote_code=True)

Load a specific checkpoint revision:

from transformers import AutoTokenizer, AutoModelForCausalLM

repo_id = "hyunseoki/verl-math-transfer-7bi-to-3bi-fix03"
revision = "step-130"
tokenizer = AutoTokenizer.from_pretrained(repo_id, revision=revision, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, revision=revision, trust_remote_code=True)

Notes

  • Architecture detected from the exported config: Qwen2ForCausalLM
  • The original base model Hub ID is not encoded in these local checkpoints, so base_model metadata is not set automatically.
  • Checkpoints were exported from verl FSDP shards into Hugging Face safetensors format.