Files
cricket-captain-qwen3-06b-m…/README.md
ModelHub XC 30256fefe0 初始化项目,由ModelHub XC社区提供模型
Model: pratinavseth/cricket-captain-qwen3-06b-merged
Source: Original Platform
2026-05-02 05:10:32 +08:00

1.3 KiB

base_model, library_name, tags, license, pipeline_tag
base_model library_name tags license pipeline_tag
Qwen/Qwen3-0.6B transformers
generated_from_trainer
trl
grpo
cricket
merged
mit text-generation

cricket-captain-qwen3-06b-merged

Qwen/Qwen3-0.6B with the pratinavseth/cricket-captain-qwen3-06b-stage2 LoRA adapter (stage 2 GRPO, step 50) merged into the base weights. Single-file model, ready to load without PEFT — usable directly via transformers, vllm, or TGI.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
tok = AutoTokenizer.from_pretrained("pratinavseth/cricket-captain-qwen3-06b-merged")
model = AutoModelForCausalLM.from_pretrained("pratinavseth/cricket-captain-qwen3-06b-merged", torch_dtype="bfloat16", device_map="auto")

The model expects the cricket-captain prompt schema produced by the OpenEnv environment in this repo (see inference.py for prompt construction).