Files
MMedS-Llama-3-8B/README.md
ModelHub XC 6ad415b716 初始化项目,由ModelHub XC社区提供模型
Model: Henrychur/MMedS-Llama-3-8B
Source: Original Platform
2026-06-23 10:54:18 +08:00

37 lines
1.4 KiB
Markdown
Raw Permalink Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
license: llama3
datasets:
- Henrychur/MMedC
- Henrychur/MedS-Ins
language:
- en
base_model: Henrychur/MMedS-Llama-3-8B
tags:
- medical
library_name: transformers
---
# MMedS-Llama3
[💻Github Repo](https://github.com/MAGIC-AI4Med/MedS-Ins) [🖨arXiv Paper](https://arxiv.org/abs/2408.12547)
The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"
## Introduction
This repository hosts MMedS-Llama-3-8B. Its foundation model, [MMed-Llama-3-8B](https://huggingface.co/Henrychur/MMed-Llama-3-8B),
is a multilingual medical language model which has undergone additional continuous pretraining on MMedC. Furthermore, the model has
been fine-tuned under supervision using MedS-Ins, a comprehensive dataset designed specifically for supervised fine-tuning (SFT),
featuring 13.5 million samples across 122 tasks. For more details, please refer to our paper.
## Usage
The model can be loaded as follows:
```py
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("Henrychur/MMed-Llama-3-8B-EnIns")
model = AutoModelForCausalLM.from_pretrained("Henrychur/MMed-Llama-3-8B-EnIns", torch_dtype=torch.float16)
```
- Inference format is the same as Llama 3, you can check the inference code [here](https://github.com/MAGIC-AI4Med/MedS-Ins/blob/main/Inference/model.py).