Go to file

ModelHub XC ed39ba220f 初始化项目，由ModelHub XC社区提供模型

Model: Huggggooo/ProtoCycle-7B
Source: Original Platform

2026-04-25 16:24:05 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

chat_template.jinja

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

model-00001-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

model-00002-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

model-00003-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

model-00004-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-04-25 16:24:05 +08:00

README.md

license, library_name, pipeline_tag, base_model, tags, language

license

library_name

pipeline_tag

base_model

ProtoCycle-7B

RL checkpoint for ProtoCycle — an agentic protein design model that performs multi-step, tool-augmented sequence design.

This is the GRPO-TCR (Group Relative Policy Optimization with Tool-Call Reward) stage, initialised from the SFT checkpoint Huggggooo/ProtoCycle-7B-SFT.

Base model: Huggggooo/ProtoCycle-7B-SFT (itself fine-tuned from Qwen/Qwen2.5-7B-Instruct)
Training framework: VeRL / Open-AgentRL
Stage: agentic RL with GRPO-TCR
Rollouts per prompt: 8, max turns: 16
Max prompt / response: 8k / 20k tokens
Reward manager: protein (see ProtoCycle/verl/workers/reward_manager/protein.py)

See recipe/protein/reward.py for the exact formulation.

Training Data

10,000 RL prompts for GRPO-TCR training, available at Huggggooo/ProtoCycle-Data (rl/ subset).}

Agent Protocol

<think>  ... reasoning ...  </think>
<plan>   ... stage plan ...  </plan>
<tool_call>{"name": "...", "arguments": {...}}</tool_call>
...
<answer>MAEGEITPLKTF...</answer>

How to Use

See the ProtoCycle repository: ProtoCycle repo.

License

Apache-2.0.

Citation

If you find this work useful, please cite ProtoCycle (forthcoming) and the upstream frameworks: VeRL, Open-AgentRL, ProTrek, ESM.