hyunseoki/verl-math-transfer-7bi-to-3bi-fix03

Files

ModelHub XC 9eeaef4124 初始化项目，由ModelHub XC社区提供模型

Model: hyunseoki/verl-math-transfer-7bi-to-3bi-fix03
Source: Original Platform

2026-05-28 14:21:00 +08:00

1.5 KiB

Raw Blame History

library_name, pipeline_tag, tags

library_name

pipeline_tag

VERL Math Transfer 7B to 3B fix03

Math transfer experiment trained with verl. This repo groups all exported Hugging Face checkpoints for the 7B-to-3B fix_0_3 configuration.

Layout

main: latest exported checkpoint, currently step-130
step revisions: step-010, step-020, step-030, step-040, step-050, step-060, step-070, step-080, step-090, step-100, step-110, step-120, step-130

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM

repo_id = "hyunseoki/verl-math-transfer-7bi-to-3bi-fix03"
tokenizer = AutoTokenizer.from_pretrained(repo_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, trust_remote_code=True)

Load a specific checkpoint revision:

from transformers import AutoTokenizer, AutoModelForCausalLM

repo_id = "hyunseoki/verl-math-transfer-7bi-to-3bi-fix03"
revision = "step-130"
tokenizer = AutoTokenizer.from_pretrained(repo_id, revision=revision, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, revision=revision, trust_remote_code=True)

Notes

Architecture detected from the exported config: Qwen2ForCausalLM
The original base model Hub ID is not encoded in these local checkpoints, so base_model metadata is not set automatically.
Checkpoints were exported from verl FSDP shards into Hugging Face safetensors format.

1.5 KiB Raw Blame History

VERL Math Transfer 7B to 3B fix03

Layout

Usage

Notes

1.5 KiB

Raw Blame History