初始化项目,由ModelHub XC社区提供模型

Model: duyntnet/Llama-3.2-3B-imatrix-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-21 09:27:18 +08:00
commit 7998ab4e55
29 changed files with 232 additions and 0 deletions

62
.gitattributes vendored Normal file
View File

@@ -0,0 +1,62 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.2-3B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

3
Llama-3.2-3B-IQ1_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3572d3a3e5c3b05e649182239dc4a706c328f4ae199b0d3440b3cbffc9e845cb
size 924187712

3
Llama-3.2-3B-IQ1_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f437d143c111939c1c754568fe4bf466015bb17d3b7d497fece05dc7f5d3bcdd
size 868154432

3
Llama-3.2-3B-IQ2_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7c0d0ea08d623f47516aad4bb0d57b47ab39254fd64544ae5f41349ed8851c62
size 1229028416

3
Llama-3.2-3B-IQ2_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:cf5d98d33839de9ba5cab6ba238c015817da9ca068bb85f14f4a1b15e73dbb3c
size 1154317376

3
Llama-3.2-3B-IQ2_XS.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a1f83c519043a27139aa1560d66c2d134794f7674f9e1e46daa8c9ab7bc660e4
size 1100545088

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0e701c7b616a3588da65714d0e76f90a4883738eaf604c7b3fd0a90aec1011b5
size 1017576512

3
Llama-3.2-3B-IQ3_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9f4957b4c090dec1024aa2e7de80e5a0bd65b2cb492a7842b30bf684faf31a50
size 1599665216

3
Llama-3.2-3B-IQ3_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e2727749e77be510e310e4197ee921794bc09049d5d8c11309b5aa003aed668b
size 1542845504

3
Llama-3.2-3B-IQ3_XS.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2c2a1462ad00942f28c771e033f74faa55f7c569b9dd35540fca180e5268ee7e
size 1476785216

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:66c85870e021b3decac7d090b5da7c2c6ea89cf71652bbdbc9fd4952b7319015
size 1348762688

3
Llama-3.2-3B-IQ4_NL.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4125969f47008ae2498b81accb2aba78b13f755d2f612dca875542da6d461d34
size 1917187136

3
Llama-3.2-3B-IQ4_XS.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:93b9ddf23edf476e76a3c84ba87c0a0af9fa6e8dfe46a810959ec6effecf8331
size 1829106752

3
Llama-3.2-3B-Q2_K.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:49efb58fd618af632b57bb9d666f3c867c744432696fdbf9e22a307116f2a05c
size 1363932224

3
Llama-3.2-3B-Q2_K_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:dc0c23b0caeb76f178c043447f13f79ea2b6de4b955cf4efaafe9e47ed1e6c3f
size 1274278976

3
Llama-3.2-3B-Q3_K_L.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:442c1190a4ef5d876f934890ee9ddf436bed6b99cc63c2179314959c1328c820
size 1815344192

3
Llama-3.2-3B-Q3_K_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4e387756075d034b3f47a17d329cb29b5da09c1171932aaead9e272bcbcfcbaf
size 1687155776

3
Llama-3.2-3B-Q3_K_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ed8626590dc0a3a354b2a688b6c6c968ef1e59d5d00d3f2ae0cb536da1342445
size 1542845504

3
Llama-3.2-3B-Q4_0.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:785147311d5004b989d0dc0f2e62697941df1e971eae2a178ab25e2637cb47df
size 1921905728

3
Llama-3.2-3B-Q4_1.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:383782f6d99457530b862032b746560850dab26b959e25ae5c7a8fc55d978036
size 2093347904

3
Llama-3.2-3B-Q4_K_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ec6e0450368c0cc407fc0376c7a3c92fd525c91574318ddf32eb3c068b6f658e
size 2019374144

3
Llama-3.2-3B-Q4_K_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a06448b5aa5c4c5d555d59cd289fc85e8e96f6dd7d30643f0550c686b1a4b6b2
size 1928197184

3
Llama-3.2-3B-Q5_0.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1a68d3d6ea1654262b25360267f325e9a644ae2163fb664a0967bcfac05a7916
size 2274227264

3
Llama-3.2-3B-Q5_1.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8e6e336754b218b101cf6856643909efa6330458ebbe8f5c9add7f5a55a08500
size 2445669440

3
Llama-3.2-3B-Q5_K_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2e36251eb8846c6ff9d1b07845273be6422af5bc66d268a71f3a36fe3ce45461
size 2322150464

3
Llama-3.2-3B-Q5_K_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a551ac498f257ca0d5eaec355e7be5f1bac85b92847987ac174bb1b94e2ea022
size 2269508672

3
Llama-3.2-3B-Q6_K.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:44c2093345ecae8da4adfd44a172629e25fe66c09471fd90c7367e12c5135243
size 2643850304

3
Llama-3.2-3B-Q8_0.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:87068a3dca41ff849f2268d6bba5655c385c56a006e845c1dcd3a2ddf9aea0e8
size 3421895744

89
README.md Normal file
View File

@@ -0,0 +1,89 @@
---
license: other
language:
- en
pipeline_tag: text-generation
inference: false
tags:
- transformers
- gguf
- imatrix
- Llama-3.2-3B
---
Quantizations of https://huggingface.co/meta-llama/Llama-3.2-3B
### Inference Clients/UIs
* [llama.cpp](https://github.com/ggerganov/llama.cpp)
* [KoboldCPP](https://github.com/LostRuins/koboldcpp)
* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)
* [ollama](https://github.com/ollama/ollama)
---
# From original readme
The Meta Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out). The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks. They outperform many of the available open source and closed chat models on common industry benchmarks.
**Model Developer:** Meta
**Model Architecture:** Llama 3.2 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
| | Training Data | Params | Input modalities | Output modalities | Context Length | GQA | Shared Embeddings | Token count | Knowledge cutoff |
| :---- | :---- | :---- | :---- | :---- | :---- | :---- | :---- | :---- | :---- |
| Llama 3.2 (text only) | A new mix of publicly available online data. | 1B (1.23B) | Multilingual Text | Multilingual Text and code | 128k | Yes | Yes | Up to 9T tokens | December 2023 |
| | | 3B (3.21B) | Multilingual Text | Multilingual Text and code | | | | | |
**Supported Languages:** English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai are officially supported. Llama 3.2 has been trained on a broader collection of languages than these 8 supported languages. Developers may fine-tune Llama 3.2 models for languages beyond these supported languages, provided they comply with the Llama 3.2 Community License and the Acceptable Use Policy. Developers are always expected to ensure that their deployments, including those that involve additional languages, are completed safely and responsibly.
**Llama 3.2 Model Family:** Token counts refer to pretraining data only. All model versions use Grouped-Query Attention (GQA) for improved inference scalability.
**Model Release Date:** Sept 25, 2024
**Status:** This is a static model trained on an offline dataset. Future versions may be released that improve model capabilities and safety.
**License:** Use of Llama 3.2 is governed by the [Llama 3.2 Community License](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/LICENSE) (a custom, commercial license agreement).
**Feedback:** Where to send questions or comments about the model Instructions on how to provide feedback or comments on the model can be found in the model [README](https://github.com/meta-llama/llama-models/tree/main/models/llama3_2). For more technical information about generation parameters and recipes for how to use Llama 3.2 in applications, please go [here](https://github.com/meta-llama/llama-recipes).
## Intended Use
**Intended Use Cases:** Llama 3.2 is intended for commercial and research use in multiple languages. Instruction tuned text only models are intended for assistant-like chat and agentic applications like knowledge retrieval and summarization, mobile AI powered writing assistants and query and prompt rewriting. Pretrained models can be adapted for a variety of additional natural language generation tasks.
**Out of Scope:** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in any other way that is prohibited by the Acceptable Use Policy and Llama 3.2 Community License. Use in languages beyond those explicitly referenced as supported in this model card.
## How to use
This repository contains two versions of Llama-3.2-3B, for use with transformers and with the original `llama` codebase.
### Use with transformers
Starting with transformers >= 4.43.0 onward, you can run conversational inference using the Transformers pipeline abstraction or by leveraging the Auto classes with the generate() function.
Make sure to update your transformers installation via pip install --upgrade transformers.
```python
import torch
from transformers import pipeline
model_id = "meta-llama/Llama-3.2-3B"
pipe = pipeline(
"text-generation",
model=model_id,
torch_dtype=torch.bfloat16,
device_map="auto"
)
pipe("The key to life is")
```
### Use with `llama`
Please, follow the instructions in the [repository](https://github.com/meta-llama/llama).
To download Original checkpoints, see the example command below leveraging `huggingface-cli`:
```
huggi