初始化项目,由ModelHub XC社区提供模型
Model: luizaaca/qwen3-0.6b-clinical-screening Source: Original Platform
This commit is contained in:
126
README.md
Normal file
126
README.md
Normal file
@@ -0,0 +1,126 @@
|
||||
---
|
||||
license: cc-by-4.0
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
pipeline_tag: text-generation
|
||||
base_model: Qwen/Qwen3-0.6B
|
||||
base_model_relation: finetune
|
||||
datasets:
|
||||
- dhivyeshrk/diseases-and-symptoms-dataset
|
||||
- niyarrbarman/symptom2disease
|
||||
tags:
|
||||
- medical
|
||||
- clinical-screening
|
||||
- symptom-to-disease
|
||||
- disease-classification
|
||||
- lora
|
||||
- qlora
|
||||
- gguf
|
||||
- unsloth
|
||||
- peft
|
||||
- transformers
|
||||
- bertscore
|
||||
---
|
||||
# Qwen3-0.6B Clinical Screening
|
||||
|
||||
This repository packages the artifacts generated by the training notebook
|
||||
[`screening_robot.ipynb`](https://github.com/luizaaca/screening_robot/blob/main/screening_robot.ipynb) for a compact clinical screening assistant based on
|
||||
Qwen3-0.6B.
|
||||
|
||||
## Artifact layout
|
||||
|
||||
- Root-level merged model files (`model.safetensors`, `config.json`,
|
||||
`tokenizer.json`, `tokenizer_config.json`, `chat_template.jinja`) for
|
||||
direct `transformers` loading.
|
||||
- `lora/`: PEFT LoRA adapter weights and tokenizer/chat-template files.
|
||||
- `gguf/qwen3-0.6b-clinical-screening.Q4_K_M.gguf`: quantized GGUF export (`Q4_K_M`) for
|
||||
llama.cpp, Ollama, LM Studio, and similar runtimes.
|
||||
- `Modelfile`: Ollama-oriented prompt wrapper aligned with the training
|
||||
prompt and inference settings used in the notebook examples.
|
||||
|
||||
## Model details
|
||||
|
||||
- **Repository**: `https://huggingface.co/luizaaca/qwen3-0.6b-clinical-screening`
|
||||
- **Base model**: `Qwen/Qwen3-0.6B`
|
||||
- **Training runtime base**: `unsloth/Qwen3-0.6B-unsloth-bnb-4bit`
|
||||
- **Training recipe**: QLoRA via Unsloth on a 4-bit loaded base model
|
||||
- **LoRA hyperparameters**: rank 16, alpha 32, dropout 0.0
|
||||
- **Target modules**: `q_proj`, `k_proj`, `v_proj`, `o_proj`, `gate_proj`,
|
||||
`up_proj`, `down_proj`
|
||||
- **Max sequence length used during training**: 1024
|
||||
- **Training steps**: 400
|
||||
- **Target hardware**: Google Colab Free with Tesla T4 (16 GB VRAM)
|
||||
|
||||
## Training data
|
||||
|
||||
The checked-in notebook trains on two Kaggle datasets:
|
||||
|
||||
- `dhivyeshrk/diseases-and-symptoms-dataset`: binary symptom-matrix data converted to natural-language
|
||||
symptom lists, with a stratified cap of 50 examples per disease before the
|
||||
train/test split.
|
||||
- `niyarrbarman/symptom2disease`: free-text symptom descriptions mapped directly to disease labels.
|
||||
|
||||
## Output contract
|
||||
|
||||
The assistant is fine-tuned to answer using the standardized disclaimer format:
|
||||
|
||||
```text
|
||||
Based on the reported symptoms, the clinical indication points to: <disease>.
|
||||
Disclaimer: This is an AI auxiliary tool designed for healthcare professionals. It is not 100% precise and does not replace a professional medical diagnosis.
|
||||
```
|
||||
|
||||
This is a plain-text disease-identification assistant. Unlike the 1.7B
|
||||
JSON specialist model, this 0.6B notebook does **not** train on a structured
|
||||
JSON schema.
|
||||
|
||||
## Prompting notes
|
||||
|
||||
The training and inference flow uses a system prompt that frames the model as a
|
||||
clinical AI assistant and user prompts shaped like:
|
||||
|
||||
```text
|
||||
Given the symptoms reported, identify the disease.
|
||||
|
||||
Symptoms: ... /no_think
|
||||
```
|
||||
|
||||
The notebook markdown discusses mixed thinking/no-thinking supervision, but the
|
||||
effective checked-in configuration sets `thinking_ratio = 0`, so the actual
|
||||
supervised examples follow the no-thinking path.
|
||||
|
||||
## Validation summary
|
||||
|
||||
The notebook evaluates the base model against the fine-tuned model on held-out
|
||||
splits from both datasets and reports:
|
||||
|
||||
- qualitative side-by-side generations;
|
||||
- Accuracy and macro-F1;
|
||||
- Cohen's Kappa;
|
||||
- row-normalized confusion matrices; and
|
||||
- BERTScore.
|
||||
|
||||
The code also asserts that fine-tuned accuracy on Dataset 2 matches or exceeds
|
||||
the base model.
|
||||
|
||||
## Intended use
|
||||
|
||||
This repository is suitable for:
|
||||
|
||||
- research experiments on lightweight clinical-screening assistants;
|
||||
- teaching and prototyping around symptom-to-disease prompting; and
|
||||
- local inference with Transformers, PEFT, GGUF-compatible runtimes, or Ollama.
|
||||
|
||||
## Out-of-scope use
|
||||
|
||||
This repository is **not** intended for:
|
||||
|
||||
- autonomous diagnosis or treatment decisions;
|
||||
- emergency triage without clinician oversight;
|
||||
- prescribing or medication guidance; or
|
||||
- use as a substitute for professional medical judgment.
|
||||
|
||||
## Safety notice
|
||||
|
||||
This is a research artifact for healthcare-support workflows only. Always keep a
|
||||
qualified human clinician in the loop.
|
||||
Reference in New Issue
Block a user