Go to file

ModelHub XC 47d5a015ff 初始化项目，由ModelHub XC社区提供模型

Model: suayptalha/Qwen3-0.6B-Treatment
Source: Original Platform

2026-06-03 06:46:14 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

configuration.json

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

model.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-06-03 06:46:14 +08:00

README.md

license, tags, datasets, language, base_model, pipeline_tag, library_name

license

Qwen3-0.6B-Treatment-Expert

This project performs full fine-tuning on the Qwen3-0.6B language model to enhance its clinical treatment planning and reasoning capabilities. The model was optimized using the bfloat16 (bf16) data type.

Training Procedure

Dataset Preparation
- Dataset: Containing paired clinical diagnosis descriptions and corresponding step-by-step treatment plans.
Model Loading and Configuration
- Base model: Qwen3-0.6B, loaded with the unsloth library in bf16 precision.
- Full fine-tuning (full_finetuning=True) applied to all layers to adapt the model for medical treatment tasks.
Supervised Fine-Tuning (SFT)
- Utilized the Hugging Face TRL library with the Supervised Fine-Tuning approach.
- The model was trained to generate both intermediate reasoning steps and final treatment recommendations.
- Training hyperparameters:
  - Epochs: 2
  - Learning rate: 2e-5
  - Batch size: 8

Purpose and Outcome

Significantly improved the model’s ability to interpret clinical diagnoses and propose structured treatment plans.

Evaluation

Performance was measured on a held-out validation set with the following metrics:
- Plan Fidelity: 59.69% similarity with DeepSeek V3-0324.
- Reasoning Coherence: Rated high by a panel of medical experts.

License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

README.md Unescape Escape

Qwen3-0.6B-Treatment-Expert

Training Procedure

Purpose and Outcome

Evaluation

License

README.md