Files
HuatuoGPT-3-8B/README.md
ModelHub XC fe4ebf92ce 初始化项目,由ModelHub XC社区提供模型
Model: FreedomIntelligence/HuatuoGPT-3-8B
Source: Original Platform
2026-06-06 18:18:12 +08:00

2.6 KiB

library_name, license, pipeline_tag, language, base_model, tags
library_name license pipeline_tag language base_model tags
transformers apache-2.0 text-generation
en
zh
Qwen/Qwen3-8B-Base
medical
reasoning

🩺 HuatuoGPT-3-8B

Introduction

HuatuoGPT-3 is an open-source medical LLM trained with SeedRL, an RL-only domain adaptation paradigm that transforms a base model into a medical expert in a single RL stage.

For more information, visit our GitHub repository: https://github.com/FreedomIntelligence/HuatuoGPT-3

Important

HuatuoGPT-3-8B is set to thinking mode by default. The output contains a <think>...</think> reasoning block followed by the final response after </think>.

Model Info

Model Description Backbone Link
HuatuoGPT-3-32B 32B medical LLM trained with SeedRL Qwen3-32B HF Link
HuatuoGPT-3-8B 8B medical LLM trained with SeedRL Qwen3-8B-Base HF Link
HuatuoGPT-3-7B-Pangu 7B medical LLM trained with SeedRL openPangu-Embedded-7B HF Link

Usage

You can use HuatuoGPT-3-8B in the same way as Qwen3-8B. You can deploy it with tools like vLLM or SGLang, or perform direct inference:

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "FreedomIntelligence/HuatuoGPT-3-8B"

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)

messages = [
    {"role": "user", "content": "What are the common causes of chest pain?"}
]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

inputs = tokenizer([text], return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=4096)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

📖 Citation

@article{huatuogpt3,
  title={HuatuoGPT-3: RL-Only Domain Adaptation from Base Models via Off-Policy Seeding},
  author={Coming soon},
  journal={arXiv preprint},
  year={2026}
}