初始化项目,由ModelHub XC社区提供模型
Model: simustar/Yi-34B-Llama-GGUF Source: Original Platform
This commit is contained in:
45
.gitattributes
vendored
Normal file
45
.gitattributes
vendored
Normal file
@@ -0,0 +1,45 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Yi-34B-Llama_Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
40
README.md
Normal file
40
README.md
Normal file
@@ -0,0 +1,40 @@
|
||||
---
|
||||
pipeline_tag: text-generation
|
||||
language:
|
||||
- en
|
||||
- zh
|
||||
tags:
|
||||
- llama
|
||||
---
|
||||
|
||||
**This model repository contains files in GGUF format for the Yi 34B LLaMA, compatible with LLaMA modeling, based on the work from the [chargoddard/Yi-34B-Llama](https://huggingface.co/chargoddard/Yi-34B-Llama) repository.**
|
||||
|
||||
Based of the work of chargoddard's:
|
||||
|
||||
- Tensors have been renamed to match the standard LLaMA.
|
||||
- Model can be loaded without trust_remote_code, but the tokenizer can not.
|
||||
|
||||
**Converted & Quantized Files**
|
||||
|
||||
### Yi-34B-Llamafied Model Options
|
||||
|
||||
The following tables list the available Yi-34B-Llamafied model files with their respective quantization methods and characteristics.
|
||||
|
||||
**Key:**
|
||||
- **Size**: File size relative to the original.
|
||||
- **Quality Loss**: The amount of quality loss due to quantization.
|
||||
|
||||
| Q-Method | File Name | Size | Quality Loss | Recommended |
|
||||
|----------|---------------------|--------|--------------------------------------|----------------------|
|
||||
| Q2 | Yi-34B-Llama_Q2_K | Smallest | Extreme *(not recommended)* | |
|
||||
| Q3 | Yi-34B-Llama_Q3_K_S | Very Small | Very High | |
|
||||
| Q3 | Yi-34B-Llama_Q3_K_M | Very Small | Very High | |
|
||||
| Q3 | Yi-34B-Llama_Q3_K_L | Small | Substantial | |
|
||||
| Q4 | Yi-34B-Llama_Q4_K_S | Small | Significant | |
|
||||
| Q4 | Yi-34B-Llama_Q4_K_M | Medium | Balanced | **Recommended** |
|
||||
| Q5 | Yi-34B-Llama_Q5_K_S | Large | Low | **Recommended** |
|
||||
| Q5 | Yi-34B-Llama_Q5_K_M | Large | Very Low | **Recommended** |
|
||||
| Q6 | Yi-34B-Llama_Q6_K | Very Large | Extremely Low | |
|
||||
| Q8 | Yi-34B-Llama_Q8_0 | Very Large | Extremely Low *(not recommended)* | |
|
||||
|
||||
Please choose the model that best suits your needs based on the size and quality loss trade-offs.
|
||||
3
Yi-34B-Llama_Q2_K.gguf
Normal file
3
Yi-34B-Llama_Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a243a59019812b481fe5e35dcdc9f18bea427665360075ce5997e8c4a5fc886f
|
||||
size 14555875264
|
||||
3
Yi-34B-Llama_Q3_K_L.gguf
Normal file
3
Yi-34B-Llama_Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:6fa7adb9983d3d01a75397a35aa1fa576f45c136c0f0e04dbb5a384e42d832c9
|
||||
size 18139445184
|
||||
3
Yi-34B-Llama_Q3_K_M.gguf
Normal file
3
Yi-34B-Llama_Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:dc06de9bc5a1a9acc114125bbea9bd492a973830d65b4356b1599708fb503b9e
|
||||
size 16636573632
|
||||
3
Yi-34B-Llama_Q3_K_S.gguf
Normal file
3
Yi-34B-Llama_Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:65497cb4bc8f580aa1ae2bad497822c27855ef127753561ae802342618b3376b
|
||||
size 14960293824
|
||||
3
Yi-34B-Llama_Q4_K_M.gguf
Normal file
3
Yi-34B-Llama_Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c4e8dba15ebd37caa4c735364be0104278aa507e2f1ed8f049dd1a9a13f65c49
|
||||
size 20658710464
|
||||
3
Yi-34B-Llama_Q4_K_S.gguf
Normal file
3
Yi-34B-Llama_Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:428574d0b79f266f3806ded29b964e7e826d508f6b4da4d6c98f8fc83bda0cd8
|
||||
size 19543599040
|
||||
3
Yi-34B-Llama_Q5_K_M.gguf
Normal file
3
Yi-34B-Llama_Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:805ad30d294d83944e91f9856c92c74d522fb6a901012f9e44b48fece08a0204
|
||||
size 24321845184
|
||||
3
Yi-34B-Llama_Q5_K_S.gguf
Normal file
3
Yi-34B-Llama_Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d388cc707bb3f27f0bee1df902fa05f706ac018439c4810719276cf5288177ea
|
||||
size 23707690944
|
||||
3
Yi-34B-Llama_Q6_K.gguf
Normal file
3
Yi-34B-Llama_Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e85f7ebb8da657201ecacaf04c7740ab7db1c79c0a41091b73874c13ad5a2d73
|
||||
size 28213925824
|
||||
3
Yi-34B-Llama_Q8_0.gguf
Normal file
3
Yi-34B-Llama_Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e3084d716947ba6be37f5bd7ecfd5d0bd92b2b1c988fc0b1b5fe5418f9954551
|
||||
size 36542281600
|
||||
Reference in New Issue
Block a user