初始化项目,由ModelHub XC社区提供模型

Model: hyunseoki/verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-24 14:49:23 +08:00
commit 0b1a40f839
13 changed files with 2644 additions and 0 deletions

51
README.md Normal file
View File

@@ -0,0 +1,51 @@
---
library_name: transformers
pipeline_tag: text-generation
tags:
- verl
- math
- grpo
- transfer
- llama
- llama-3.1
- llama-3.2
- 8b
- 3b
- pool7to1
---
# VERL Math Transfer Llama 3.1 8B to Llama 3.2 3B pool7to1
Math transfer experiment trained with verl. This repo groups all exported Hugging Face checkpoints for the Llama 3.1 8B to Llama 3.2 3B pool7to1 configuration.
## Layout
- `main`: latest exported checkpoint, currently `step-080`
- step revisions: `step-010, step-020, step-030, step-040, step-050, step-060, step-070, step-080`
## Usage
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
repo_id = "hyunseoki/verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1"
tokenizer = AutoTokenizer.from_pretrained(repo_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, trust_remote_code=True)
```
Load a specific checkpoint revision:
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
repo_id = "hyunseoki/verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1"
revision = "step-080"
tokenizer = AutoTokenizer.from_pretrained(repo_id, revision=revision, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, revision=revision, trust_remote_code=True)
```
## Notes
- Architecture detected from the exported config: `LlamaForCausalLM`
- The original base model Hub ID is not encoded in these local checkpoints, so `base_model` metadata is not set automatically.
- Checkpoints were exported from verl FSDP shards into Hugging Face `safetensors` format.