Meta-Llama-3-8B-Instruct-fe…/README.md

---
library_name: transformers
pipeline_tag: text-generation
base_model:
- meta-llama/Meta-Llama-3-8B-Instruct
tags:
- peft
- lora
- merged
---
# Meta-Llama-3-8B-Instruct-fedavg-v0

This repository contains a full merged model produced by applying a PEFT LoRA adapter to its base model. It is intended for reproducible evaluation without requiring a separate adapter loading path.

## Quick Start

```python
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "wvnvwn/Meta-Llama-3-8B-Instruct-fedavg-v0"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)
```

Run the bundled FSL evaluation wrapper:

```bash
bash fsl/src/evaluation/overall_eval.sh \
    --model_name wvnvwn/Meta-Llama-3-8B-Instruct-fedavg-v0 \
    --model_type instruct \
    --max_samples 100 \
    --output_root fsl/results/evaluation/Meta-Llama-3-8B-Instruct-fedavg-v0
```

## Training Procedure

- Procedure: Federated LoRA fine-tuning followed by adapter aggregation. The resulting PEFT LoRA adapter was merged into the base model for reproducible evaluation.
- Algorithm: `fedavg`
- Training data: `data_hetero_with_4_tasks`
- Number of clients: `8`
- Communication round/checkpoint: `3`
- Local epochs: `3`
- Local batch size: `256`
- Local micro batch size: `16`
- Local learning rate: `0.0003`

Original training command:

Not specified

## Merge Metadata

- Base model: `meta-llama/Meta-Llama-3-8B-Instruct`
- Adapter source: `/NHNHOME/0226010080_A/BASE/jongbokwon/FLS/fsl/outputs/fedavg/8/2`
- PEFT type: `LORA`
- Task type: `CAUSAL_LM`
- LoRA rank: `16`
- LoRA alpha: `16`
- Target modules: `up_proj`, `v_proj`, `gate_proj`, `q_proj`, `k_proj`, `o_proj`, `down_proj`
- Merged at UTC: `2026-05-22T05:24:31.213169+00:00`

## Framework Versions

- python: `3.10.20`
- platform: `Linux-6.8.0-100-generic-x86_64-with-glibc2.39`
- torch: `2.11.0+cu130`
- cuda: `13.0`
- transformers: `4.57.1`
- peft: `0.19.1`
- accelerate: `1.13.0`
- huggingface_hub: `0.36.2`
- safetensors: `0.7.0`
- vllm: `0.20.2`
- wandb: `0.27.0`

## Notes

- This is a merged full-weight model, not an adapter-only checkpoint.
- Redistribution/access should follow the base model license and access policy.
- `merge_info.json` in this repository records the adapter and merge configuration.