Llama3.2-3B-Explained-GGUF/README.md

---
base_model:
- theprint/Llama3.2-3B-Explained
tags:
- fine-tuned
- lora
- sft
- auto-sft
language:
- en
library_name: transformers
---

# Llama3.2-3B-Explained (GGUF)

A fine-tuned version of [`meta-llama/Llama-3.2-3B-Instruct`](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) trained on **Explained 0.41k alpaca** data using [Auto-SFT](https://github.com/your-org/auto-sft) — an automated hyperparameter search and supervised fine-tuning pipeline.

The base model was adapted to follow the style and content of the `Explained 0.41k alpaca` dataset. Expect improved performance on tasks similar to those represented in the training data.

## Model Details

| Property | Value |
|---|---|
| Base model | `meta-llama/Llama-3.2-3B-Instruct` |
| Training data | `data/Explained-0.41k-alpaca.json` |
| Fine-tuning epochs | 2 |
| Fine-tuning date | 2026-03-25 |
| Fine-tuning method | LoRA (merged to full 16-bit) |

## Training Hyperparameters

### LoRA

| Parameter | Value |
|---|---|
| `r` | `4` |
| `alpha` | `8` |
| `dropout` | `0.0` |
| `target_modules` | `['q_proj', 'v_proj', 'k_proj', 'o_proj']` |

### Training

| Parameter | Value |
|---|---|
| `learning_rate` | `1e-05` |
| `batch_size` | `1` |
| `gradient_accumulation_steps` | `2` |
| `warmup_ratio` | `0.0` |
| `max_seq_length` | `512` |

## GGUF Files

These quantized GGUF files can be used directly with [llama.cpp](https://github.com/ggerganov/llama.cpp), [Ollama](https://ollama.com/), [LM Studio](https://lmstudio.ai/), and other compatible runtimes.

| File | Description |
|---|---|
| `Llama3.2-3B-Explained-BF16.gguf` | BF16 |
| `Llama3.2-3B-Explained-Q8_0.gguf` | 8-bit — near-lossless, larger file |
| `Llama3.2-3B-Explained-Q6_K.gguf` | 6-bit — high quality |
| `Llama3.2-3B-Explained-Q5_K_M.gguf` | 5-bit medium — good quality/size balance |
| `Llama3.2-3B-Explained-Q5_K_S.gguf` | Q5_K_S |
| `Llama3.2-3B-Explained-Q4_K_M.gguf` | 4-bit medium — recommended for most use cases |
| `Llama3.2-3B-Explained-Q4_K_S.gguf` | Q4_K_S |
| `Llama3.2-3B-Explained-Q3_K_L.gguf` | Q3_K_L |
| `Llama3.2-3B-Explained-Q3_K_M.gguf` | Q3_K_M |
| `Llama3.2-3B-Explained-Q3_K_S.gguf` | Q3_K_S |
| `Llama3.2-3B-Explained-Q2_K.gguf` | 2-bit — smallest size, lowest quality |
| `Llama3.2-3B-Explained-IQ4_NL.gguf` | IQ4_NL |

---
*Generated by [Auto-SFT](https://github.com/your-org/auto-sft)*