medgemma-mentalist-gguf/README.md

---
base_model:
- google/medgemma-27b-text-it
- unsloth/medgemma-27b-text-it
tags:
- text-generation-inference
- transformers
- unsloth
- gemma3_text
- trl
- llama.cpp
- mental-health
- psychology
license: apache-2.0
language:
- en
datasets:
- hllzmz/synthetic-mental-health-convos
---

# MedGemma-27B Mentalist

GGUF quantizations for `medgemma-mentalist`.

- Base model: `unsloth/medgemma-27b-text-it`
- LoRA adapter: https://huggingface.co/hllzmz/medgemma-mentalist
- GGUF repo: https://huggingface.co/hllzmz/medgemma-mentalist-gguf

## Files (GGUF)
- `medgemma-mentalist-q4_k_m.gguf`
- `medgemma-mentalist-q5_k_m.gguf`
- `medgemma-mentalist-q8_0.gguf`

### Which quant should I use?
- **q4_k_m**: lowest RAM/VRAM usage, fastest, lowest quality among these.
- **q5_k_m**: best quality/speed/memory balance for most users.
- **q8_0**: highest quality (closest to bfloat16), largest file, needs most RAM/VRAM.

**MedGemma Mentalist** is a specialized **Mental Health Assistant** model fine-tuned on top of Google's `medgemma-27b`. It has been trained using with high-quality synthetic client-therapist dialogues.

This model is designed to understand users' emotional states, interpret described experiences through the lens of mental health and clinical standards, and provide supportive guidance using a warm, empathetic tone.

## Critical Disclaimer
**This model is NOT a licensed medical professional.**

* It **cannot** provide definitive medical diagnoses.
* It **cannot** prescribe medication.
* It **cannot** replace emergency services in crisis situations (e.g., suicide, self-harm, harm to others).
* **It is intended solely for educational, research, and preliminary informational purposes.**

## Model Capabilities
* **Empathetic Dialogue:** Listens to the user without judgement and validates their feelings (Active Listening).
* **Symptom Analysis:** correlates user-described experiences with clinical terminology and criteria.
* **Safety First:** Prioritizes safety planning and refers users to professional help when risk signals are detected.

## How to Use

This quantized models can be run with llama-cli, LM Studio or something that able to run gguf models. For optimal performance and role adherence, please strictly follow the recommended prompts and parameters below.

### Recommended System Prompt 
```
SYSTEM_PROMPT = """
    You are MedGemma Mentalist, an advanced AI mental health assistant designed to provide empathetic support, scientifically grounded psychoeducation, and guidance.
    Your goal is to be a bridge to professional help and a source of reliable mental health information.
    You should NEVER diagnose or stigmatize the user directly. 
    """
```

### Recommended Parameters
To prevent the model from hallucinating or accidentally roleplaying as the client (User), the following generation settings are highly recommended:

Temperature: 0.3 - 0.5 (Lower values ensure the model remains objective and grounded in mental health knowledge).

Repetition Penalty: 1.1 (Prevents the model from getting stuck in loops).


[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
初始化项目，由ModelHub XC社区提供模型 Model: hllzmz/medgemma-mentalist-gguf Source: Original Platform 2026-05-14 00:52:36 +08:00			`---`
			`base_model:`
			`- google/medgemma-27b-text-it`
			`- unsloth/medgemma-27b-text-it`
			`tags:`
			`- text-generation-inference`
			`- transformers`
			`- unsloth`
			`- gemma3_text`
			`- trl`
			`- llama.cpp`
			`- mental-health`
			`- psychology`
			`license: apache-2.0`
			`language:`
			`- en`
			`datasets:`
			`- hllzmz/synthetic-mental-health-convos`
			`---`

			`# MedGemma-27B Mentalist`

			GGUF quantizations for `medgemma-mentalist`.

			- Base model: `unsloth/medgemma-27b-text-it`
			`- LoRA adapter: https://huggingface.co/hllzmz/medgemma-mentalist`
			`- GGUF repo: https://huggingface.co/hllzmz/medgemma-mentalist-gguf`

			`## Files (GGUF)`
			- `medgemma-mentalist-q4_k_m.gguf`
			- `medgemma-mentalist-q5_k_m.gguf`
			- `medgemma-mentalist-q8_0.gguf`

			`### Which quant should I use?`
			`- q4_k_m: lowest RAM/VRAM usage, fastest, lowest quality among these.`
			`- q5_k_m: best quality/speed/memory balance for most users.`
			`- q8_0: highest quality (closest to bfloat16), largest file, needs most RAM/VRAM.`

			MedGemma Mentalist is a specialized Mental Health Assistant model fine-tuned on top of Google's `medgemma-27b`. It has been trained using with high-quality synthetic client-therapist dialogues.

			`This model is designed to understand users' emotional states, interpret described experiences through the lens of mental health and clinical standards, and provide supportive guidance using a warm, empathetic tone.`

			`## Critical Disclaimer`
			`This model is NOT a licensed medical professional.`

			`* It cannot provide definitive medical diagnoses.`
			`* It cannot prescribe medication.`
			`* It cannot replace emergency services in crisis situations (e.g., suicide, self-harm, harm to others).`
			`* It is intended solely for educational, research, and preliminary informational purposes.`

			`## Model Capabilities`
			`* Empathetic Dialogue: Listens to the user without judgement and validates their feelings (Active Listening).`
			`* Symptom Analysis: correlates user-described experiences with clinical terminology and criteria.`
			`* Safety First: Prioritizes safety planning and refers users to professional help when risk signals are detected.`

			`## How to Use`

			`This quantized models can be run with llama-cli, LM Studio or something that able to run gguf models. For optimal performance and role adherence, please strictly follow the recommended prompts and parameters below.`

			`### Recommended System Prompt`
			```
			`SYSTEM_PROMPT = """`
			`You are MedGemma Mentalist, an advanced AI mental health assistant designed to provide empathetic support, scientifically grounded psychoeducation, and guidance.`
			`Your goal is to be a bridge to professional help and a source of reliable mental health information.`
			`You should NEVER diagnose or stigmatize the user directly.`
			`"""`
			```

			`### Recommended Parameters`
			`To prevent the model from hallucinating or accidentally roleplaying as the client (User), the following generation settings are highly recommended:`

			`Temperature: 0.3 - 0.5 (Lower values ensure the model remains objective and grounded in mental health knowledge).`

			`Repetition Penalty: 1.1 (Prevents the model from getting stuck in loops).`


			`[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)`