Go to file

ModelHub XC 85eb2067a6 初始化项目，由ModelHub XC社区提供模型

Model: sethuiyer/Medichat-Llama3-8B
Source: Original Platform

2026-05-16 22:36:11 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

medichat_llam3.webp

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

mergekit_config.yml

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00001-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00002-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00003-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00004-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00005-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00006-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00007-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00008-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model-00009-of-00009.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-16 22:36:11 +08:00

README.md

base_model, library_name, tags, license, datasets, model-index, language

base_model

library_name

tags

license

datasets

model-index

language

Undi95/Llama-3-Unholy-8B

Locutusque/llama-3-neural-chat-v1-8b

ruslanmv/Medical-Llama3-8B-16bit

transformers

mergekit

merge

medical

other

mlabonne/orpo-dpo-mix-40k

Open-Orca/SlimOrca-Dedup

jondurbin/airoboros-3.2

microsoft/orca-math-word-problems-200k

m-a-p/Code-Feedback

MaziyarPanahi/WizardLM_evol_instruct_V2_196k

ruslanmv/ai-medical-chatbot

name

results

Medichat-Llama3-8B

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	59.13	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	82.9	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	60.35	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	49.65

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	78.93	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	60.35	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B	Open LLM Leaderboard

Medichat-Llama3-8B

Built upon the powerful LLaMa-3 architecture and fine-tuned on an extensive dataset of health information, this model leverages its vast medical knowledge to offer clear, comprehensive answers.

This model is generally better for accurate and informative responses, particularly for users seeking in-depth medical advice.

The following YAML configuration was used to produce this model:


models:
  - model: Undi95/Llama-3-Unholy-8B
    parameters:
      weight: [0.25, 0.35, 0.45, 0.35, 0.25]
      density: [0.1, 0.25, 0.5, 0.25, 0.1]
  - model: Locutusque/llama-3-neural-chat-v1-8b
  - model: ruslanmv/Medical-Llama3-8B-16bit
    parameters:
      weight: [0.55, 0.45, 0.35, 0.45, 0.55]
      density: [0.1, 0.25, 0.5, 0.25, 0.1]
merge_method: dare_ties
base_model: Locutusque/llama-3-neural-chat-v1-8b
parameters:
  int8_mask: true
dtype: bfloat16

Comparision Against Dr.Samantha 7B

Subject	Medichat-Llama3-8B Accuracy (%)	Dr. Samantha Accuracy (%)
Clinical Knowledge	71.70	52.83
Medical Genetics	78.00	49.00
Human Aging	70.40	58.29
Human Sexuality	73.28	55.73
College Medicine	62.43	38.73
Anatomy	64.44	41.48
College Biology	72.22	52.08
High School Biology	77.10	53.23
Professional Medicine	63.97	38.73
Nutrition	73.86	50.33
Professional Psychology	68.95	46.57
Virology	54.22	41.57
High School Psychology	83.67	66.60
Average	70.33	48.85

The current model demonstrates a substantial improvement over the previous Dr. Samantha model in terms of subject-specific knowledge and accuracy.

Usage:

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

class MedicalAssistant:
    def __init__(self, model_name="sethuiyer/Medichat-Llama3-8B", device="cuda"):
        self.device = device
        self.tokenizer = AutoTokenizer.from_pretrained(model_name)
        self.model = AutoModelForCausalLM.from_pretrained(model_name).to(self.device)
        self.sys_message = ''' 
        You are an AI Medical Assistant trained on a vast dataset of health information. Please be thorough and
        provide an informative answer. If you don't know the answer to a specific medical inquiry, advise seeking professional help.
        '''

    def format_prompt(self, question):
        messages = [
            {"role": "system", "content": self.sys_message},
            {"role": "user", "content": question}
        ]
        prompt = self.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
        return prompt

    def generate_response(self, question, max_new_tokens=512):
        prompt = self.format_prompt(question)
        inputs = self.tokenizer(prompt, return_tensors="pt").to(self.device)
        with torch.no_grad():
            outputs = self.model.generate(**inputs, max_new_tokens=max_new_tokens, use_cache=True)
        answer = self.tokenizer.batch_decode(outputs, skip_special_tokens=True)[0].strip()
        return answer

if __name__ == "__main__":
    assistant = MedicalAssistant()
    question = '''
    Symptoms:
    Dizziness, headache, and nausea.

    What is the differential diagnosis?
    '''
    response = assistant.generate_response(question)
    print(response)

Quants

Thanks to Quant Factory, the quantized version of this model is available at QuantFactory/Medichat-Llama3-8B-GGUF,

Ollama

This model is now also available on Ollama. You can use it by running the command ollama run monotykamary/medichat-llama3 in your terminal. If you have limited computing resources, check out this video to learn how to run it on a Google Colab backend.