初始化项目,由ModelHub XC社区提供模型

Model: RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-21 17:55:36 +08:00
commit 5e7ba7f537
24 changed files with 236 additions and 0 deletions

57
.gitattributes vendored Normal file
View File

@@ -0,0 +1,57 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
llama3-1_8b_oh_v3.1_wo_share_gpt.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

113
README.md Normal file
View File

@@ -0,0 +1,113 @@
Quantization made by Richard Erkhov.
[Github](https://github.com/RichardErkhov)
[Discord](https://discord.gg/pvy7H8DZMG)
[Request more models](https://github.com/RichardErkhov/quant_request)
llama3-1_8b_oh_v3.1_wo_share_gpt - GGUF
- Model creator: https://huggingface.co/mlfoundations-dev/
- Original model: https://huggingface.co/mlfoundations-dev/llama3-1_8b_oh_v3.1_wo_share_gpt/
| Name | Quant method | Size |
| ---- | ---- | ---- |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q2_K.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q2_K.gguf) | Q2_K | 2.96GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_XS.gguf) | IQ3_XS | 3.28GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_S.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_S.gguf) | IQ3_S | 3.43GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_S.gguf) | Q3_K_S | 3.41GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_M.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.IQ3_M.gguf) | IQ3_M | 3.52GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K.gguf) | Q3_K | 3.74GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_M.gguf) | Q3_K_M | 3.74GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q3_K_L.gguf) | Q3_K_L | 4.03GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.IQ4_XS.gguf) | IQ4_XS | 4.18GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_0.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_0.gguf) | Q4_0 | 4.34GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.IQ4_NL.gguf) | IQ4_NL | 4.38GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K_S.gguf) | Q4_K_S | 4.37GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K.gguf) | Q4_K | 4.58GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_K_M.gguf) | Q4_K_M | 4.58GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_1.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q4_1.gguf) | Q4_1 | 4.78GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_0.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_0.gguf) | Q5_0 | 5.21GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K_S.gguf) | Q5_K_S | 5.21GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K.gguf) | Q5_K | 5.34GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_K_M.gguf) | Q5_K_M | 5.34GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_1.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q5_1.gguf) | Q5_1 | 5.65GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q6_K.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q6_K.gguf) | Q6_K | 6.14GB |
| [llama3-1_8b_oh_v3.1_wo_share_gpt.Q8_0.gguf](https://huggingface.co/RichardErkhov/mlfoundations-dev_-_llama3-1_8b_oh_v3.1_wo_share_gpt-gguf/blob/main/llama3-1_8b_oh_v3.1_wo_share_gpt.Q8_0.gguf) | Q8_0 | 7.95GB |
Original model description:
---
library_name: transformers
license: llama3.1
base_model: meta-llama/Meta-Llama-3.1-8B
tags:
- llama-factory
- full
- generated_from_trainer
model-index:
- name: llama3-1_8b_oh_v3.1_wo_share_gpt
results: []
---
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->
# llama3-1_8b_oh_v3.1_wo_share_gpt
This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) on the mlfoundations-dev/oh_v3.1_wo_share_gpt dataset.
It achieves the following results on the evaluation set:
- Loss: 0.6453
## Model description
More information needed
## Intended uses & limitations
More information needed
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 5e-06
- train_batch_size: 16
- eval_batch_size: 8
- seed: 42
- distributed_type: multi-GPU
- num_devices: 32
- total_train_batch_size: 512
- total_eval_batch_size: 256
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: constant
- lr_scheduler_warmup_ratio: 0.1
- lr_scheduler_warmup_steps: 1738
- num_epochs: 3.0
### Training results
| Training Loss | Epoch | Step | Validation Loss |
|:-------------:|:-----:|:----:|:---------------:|
| 0.6517 | 1.0 | 422 | 0.6541 |
| 0.6045 | 2.0 | 844 | 0.6440 |
| 0.5731 | 3.0 | 1266 | 0.6453 |
### Framework versions
- Transformers 4.46.1
- Pytorch 2.4.0
- Datasets 3.0.2
- Tokenizers 0.20.3

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f45466539868d4312f29c4a67a7b22933b3f3db31c51e180f4c54fc7955b831a
size 3784824448

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7426dd002fcb5795d080ac853c1eb1aada5ad4d69491ebb2f82423b2135b5d12
size 3682326144

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:636e2b703abd93ab5b0867b4e2fff06e0ebbf33f677daa904dae17cce96bbf40
size 3518748288

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:11e0cf2d44a36fbccd268c1525103a67748c458c8f4640f8a3566da2bc7b6d97
size 4707350144

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7c542c3b200d7f9105248af0a6442152ba9cc127485a61d583362b71723d77d0
size 4484363904

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4cb10aadf19018fd44251ad8e29e1d30fff2f53a0a374f6da59846e9134703aa
size 3179132544

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:aa2bf742bd4c0cbd245922346bbf18ea872025222843bff276620334e65098df
size 4018919040

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:44aeb6597409b9e6b6adeb1906ce9ae8baeefab65a26d97127c37a96a204ee9c
size 4321957504

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:aa2bf742bd4c0cbd245922346bbf18ea872025222843bff276620334e65098df
size 4018919040

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:50d2057cff174ac49e3fb167f21412b1a91f6738094e5b945d37476bbf169451
size 3664500352

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a0d7cf00be561b1678804f5f4dcd2501c06d129d617ab7e51cc911b9f313dde1
size 4661212800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e1d3731f50966f3667d205b82037df3aed17253a92e1809f6ddf13df1c00f02d
size 5130253952

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1407d1abbd60ebef5e90adc927392c984f48fafd925c0062004373a28a4d4ab5
size 4920735360

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1407d1abbd60ebef5e90adc927392c984f48fafd925c0062004373a28a4d4ab5
size 4920735360

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3109648a81fd3adab342aec830261b7e80af89d61c2018486d2acbd9aaf3be1f
size 4692670080

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d3f6e22e54ca52237cf999fddfdeb721549455fed2704971b507c36e6dc6e1eb
size 5599295104

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:88581470a308f1ed72b531387b4044c39993590b6c09d41a6faa1e4e0bacdd44
size 6068336256

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2f2a2647188ec0a1d49dccd125a57175361d761b29c820ac815788b99584dd8c
size 5732988544

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2f2a2647188ec0a1d49dccd125a57175361d761b29c820ac815788b99584dd8c
size 5732988544

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f0e4c0a3ce2f84d37faa506d867432e79f4ccd2d1a3c6f97a69a5e75c9c7523c
size 5599295104

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fb22e10d7b8ce1d59e2adfd89c5026cb12dbc754c8c6e2999126e5026c77745a
size 6596007552

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ce0892784cc628b7eed06c9daa47cbae80453dd0caa522d15fa3fb79fbf68f8c
size 8540771968