Base Model: meta-llama/Llama-3.2-1B-Instruct
Dataset: OpenMed/Medical-Reasoning-SFT-MiniMax-M2.1
Model Overview
LlamaTron RS1 Nemesis is a medical reasoning model produced by fine-tuning meta-llama/Llama-3.2-1B-Instruct on the Medical-Reasoning-SFT-MiniMax-M2.1 dataset using QLoRA. The dataset contains 204,773 clinical reasoning conversations with full chain-of-thought traces covering differential diagnosis, treatment planning, pharmacology, and clinical case analysis.
Despite being a 1 billion parameter model, it handles complex clinical questions with structured and coherent reasoning.
Differential diagnosis, treatment planning, pharmacology, clinical case analysis
How to Use
Load the Model
importtorchfromtransformersimportAutoModelForCausalLM,AutoTokenizer,pipelinemodel_id="Rumiii/LlamaTron_RS1_Nemesis_1B"tokenizer=AutoTokenizer.from_pretrained(model_id)model=AutoModelForCausalLM.from_pretrained(model_id,torch_dtype=torch.bfloat16,device_map="auto",)pipe=pipeline("text-generation",model=model,tokenizer=tokenizer)messages=[{"role":"system","content":"You are LlamaTron RS1 Nemesis, a knowledgeable and compassionate medical AI assistant. Provide accurate, evidence-based medical information clearly and helpfully."},{"role":"user","content":"What are the early symptoms of Type 2 Diabetes?"},]output=pipe(messages,max_new_tokens=400,do_sample=True,temperature=0.7,top_p=0.9,)print(output[0]["generated_text"][-1]["content"])