77 lines
3.1 KiB
Markdown
77 lines
3.1 KiB
Markdown
|
|
---
|
||
|
|
base_model:
|
||
|
|
- google/medgemma-27b-text-it
|
||
|
|
- unsloth/medgemma-27b-text-it
|
||
|
|
tags:
|
||
|
|
- text-generation-inference
|
||
|
|
- transformers
|
||
|
|
- unsloth
|
||
|
|
- gemma3_text
|
||
|
|
- trl
|
||
|
|
- llama.cpp
|
||
|
|
- mental-health
|
||
|
|
- psychology
|
||
|
|
license: apache-2.0
|
||
|
|
language:
|
||
|
|
- en
|
||
|
|
datasets:
|
||
|
|
- hllzmz/synthetic-mental-health-convos
|
||
|
|
---
|
||
|
|
|
||
|
|
# MedGemma-27B Mentalist
|
||
|
|
|
||
|
|
GGUF quantizations for `medgemma-mentalist`.
|
||
|
|
|
||
|
|
- Base model: `unsloth/medgemma-27b-text-it`
|
||
|
|
- LoRA adapter: https://huggingface.co/hllzmz/medgemma-mentalist
|
||
|
|
- GGUF repo: https://huggingface.co/hllzmz/medgemma-mentalist-gguf
|
||
|
|
|
||
|
|
## Files (GGUF)
|
||
|
|
- `medgemma-mentalist-q4_k_m.gguf`
|
||
|
|
- `medgemma-mentalist-q5_k_m.gguf`
|
||
|
|
- `medgemma-mentalist-q8_0.gguf`
|
||
|
|
|
||
|
|
### Which quant should I use?
|
||
|
|
- **q4_k_m**: lowest RAM/VRAM usage, fastest, lowest quality among these.
|
||
|
|
- **q5_k_m**: best quality/speed/memory balance for most users.
|
||
|
|
- **q8_0**: highest quality (closest to bfloat16), largest file, needs most RAM/VRAM.
|
||
|
|
|
||
|
|
**MedGemma Mentalist** is a specialized **Mental Health Assistant** model fine-tuned on top of Google's `medgemma-27b`. It has been trained using with high-quality synthetic client-therapist dialogues.
|
||
|
|
|
||
|
|
This model is designed to understand users' emotional states, interpret described experiences through the lens of mental health and clinical standards, and provide supportive guidance using a warm, empathetic tone.
|
||
|
|
|
||
|
|
## Critical Disclaimer
|
||
|
|
**This model is NOT a licensed medical professional.**
|
||
|
|
|
||
|
|
* It **cannot** provide definitive medical diagnoses.
|
||
|
|
* It **cannot** prescribe medication.
|
||
|
|
* It **cannot** replace emergency services in crisis situations (e.g., suicide, self-harm, harm to others).
|
||
|
|
* **It is intended solely for educational, research, and preliminary informational purposes.**
|
||
|
|
|
||
|
|
## Model Capabilities
|
||
|
|
* **Empathetic Dialogue:** Listens to the user without judgement and validates their feelings (Active Listening).
|
||
|
|
* **Symptom Analysis:** correlates user-described experiences with clinical terminology and criteria.
|
||
|
|
* **Safety First:** Prioritizes safety planning and refers users to professional help when risk signals are detected.
|
||
|
|
|
||
|
|
## How to Use
|
||
|
|
|
||
|
|
This quantized models can be run with llama-cli, LM Studio or something that able to run gguf models. For optimal performance and role adherence, please strictly follow the recommended prompts and parameters below.
|
||
|
|
|
||
|
|
### Recommended System Prompt
|
||
|
|
```
|
||
|
|
SYSTEM_PROMPT = """
|
||
|
|
You are MedGemma Mentalist, an advanced AI mental health assistant designed to provide empathetic support, scientifically grounded psychoeducation, and guidance.
|
||
|
|
Your goal is to be a bridge to professional help and a source of reliable mental health information.
|
||
|
|
You should NEVER diagnose or stigmatize the user directly.
|
||
|
|
"""
|
||
|
|
```
|
||
|
|
|
||
|
|
### Recommended Parameters
|
||
|
|
To prevent the model from hallucinating or accidentally roleplaying as the client (User), the following generation settings are highly recommended:
|
||
|
|
|
||
|
|
Temperature: 0.3 - 0.5 (Lower values ensure the model remains objective and grounded in mental health knowledge).
|
||
|
|
|
||
|
|
Repetition Penalty: 1.1 (Prevents the model from getting stuck in loops).
|
||
|
|
|
||
|
|
|
||
|
|
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|