Files
verl-math-transfer-llama31-…/README.md
ModelHub XC 0b1a40f839 初始化项目,由ModelHub XC社区提供模型
Model: hyunseoki/verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1
Source: Original Platform
2026-05-24 14:49:23 +08:00

52 lines
1.5 KiB
Markdown

---
library_name: transformers
pipeline_tag: text-generation
tags:
- verl
- math
- grpo
- transfer
- llama
- llama-3.1
- llama-3.2
- 8b
- 3b
- pool7to1
---
# VERL Math Transfer Llama 3.1 8B to Llama 3.2 3B pool7to1
Math transfer experiment trained with verl. This repo groups all exported Hugging Face checkpoints for the Llama 3.1 8B to Llama 3.2 3B pool7to1 configuration.
## Layout
- `main`: latest exported checkpoint, currently `step-080`
- step revisions: `step-010, step-020, step-030, step-040, step-050, step-060, step-070, step-080`
## Usage
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
repo_id = "hyunseoki/verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1"
tokenizer = AutoTokenizer.from_pretrained(repo_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, trust_remote_code=True)
```
Load a specific checkpoint revision:
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
repo_id = "hyunseoki/verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1"
revision = "step-080"
tokenizer = AutoTokenizer.from_pretrained(repo_id, revision=revision, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, revision=revision, trust_remote_code=True)
```
## Notes
- Architecture detected from the exported config: `LlamaForCausalLM`
- The original base model Hub ID is not encoded in these local checkpoints, so `base_model` metadata is not set automatically.
- Checkpoints were exported from verl FSDP shards into Hugging Face `safetensors` format.