初始化项目,由ModelHub XC社区提供模型

Model: bartowski/ReWiz-Qwen-2.5-14B-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-17 20:38:14 +08:00
commit 01abd17a4e
30 changed files with 278 additions and 0 deletions

62
.gitattributes vendored Normal file
View File

@@ -0,0 +1,62 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q6_K_L.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q5_K_L.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q4_K_L.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q4_0_8_8.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q4_0_4_8.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q4_0_4_4.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q3_K_XL.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q2_K_L.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B-f16.gguf filter=lfs diff=lfs merge=lfs -text
ReWiz-Qwen-2.5-14B.imatrix filter=lfs diff=lfs merge=lfs -text

134
README.md Normal file
View File

@@ -0,0 +1,134 @@
---
quantized_by: bartowski
pipeline_tag: text-generation
tags:
- text-generation-inference
- transformers
- unsloth
- qwen2
- trl
- sft
- theprint
- rewiz
language:
- en
license: apache-2.0
base_model: theprint/ReWiz-Qwen-2.5-14B
datasets:
- theprint/ReWiz
---
## Llamacpp imatrix Quantizations of ReWiz-Qwen-2.5-14B
Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b4014">b4014</a> for quantization.
Original model: https://huggingface.co/theprint/ReWiz-Qwen-2.5-14B
All quants made using imatrix option with dataset from [here](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
Run them in [LM Studio](https://lmstudio.ai/)
## Prompt format
No prompt format found, check original model page
## Download a file (not the whole branch) from below:
| Filename | Quant type | File Size | Split | Description |
| -------- | ---------- | --------- | ----- | ----------- |
| [ReWiz-Qwen-2.5-14B-f16.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-f16.gguf) | f16 | 29.55GB | false | Full F16 weights. |
| [ReWiz-Qwen-2.5-14B-Q8_0.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q8_0.gguf) | Q8_0 | 15.70GB | false | Extremely high quality, generally unneeded but max available quant. |
| [ReWiz-Qwen-2.5-14B-Q6_K_L.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q6_K_L.gguf) | Q6_K_L | 12.50GB | false | Uses Q8_0 for embed and output weights. Very high quality, near perfect, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q6_K.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q6_K.gguf) | Q6_K | 12.12GB | false | Very high quality, near perfect, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q5_K_L.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q5_K_L.gguf) | Q5_K_L | 10.99GB | false | Uses Q8_0 for embed and output weights. High quality, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q5_K_M.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q5_K_M.gguf) | Q5_K_M | 10.51GB | false | High quality, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q5_K_S.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q5_K_S.gguf) | Q5_K_S | 10.27GB | false | High quality, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q4_K_L.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q4_K_L.gguf) | Q4_K_L | 9.57GB | false | Uses Q8_0 for embed and output weights. Good quality, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q4_K_M.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q4_K_M.gguf) | Q4_K_M | 8.99GB | false | Good quality, default size for must use cases, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q3_K_XL.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q3_K_XL.gguf) | Q3_K_XL | 8.61GB | false | Uses Q8_0 for embed and output weights. Lower quality but usable, good for low RAM availability. |
| [ReWiz-Qwen-2.5-14B-Q4_K_S.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q4_K_S.gguf) | Q4_K_S | 8.57GB | false | Slightly lower quality with more space savings, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q4_0.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q4_0.gguf) | Q4_0 | 8.54GB | false | Legacy format, generally not worth using over similarly sized formats |
| [ReWiz-Qwen-2.5-14B-Q4_0_8_8.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q4_0_8_8.gguf) | Q4_0_8_8 | 8.52GB | false | Optimized for ARM inference. Requires 'sve' support (see link below). *Don't use on Mac or Windows*. |
| [ReWiz-Qwen-2.5-14B-Q4_0_4_8.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q4_0_4_8.gguf) | Q4_0_4_8 | 8.52GB | false | Optimized for ARM inference. Requires 'i8mm' support (see link below). *Don't use on Mac or Windows*. |
| [ReWiz-Qwen-2.5-14B-Q4_0_4_4.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q4_0_4_4.gguf) | Q4_0_4_4 | 8.52GB | false | Optimized for ARM inference. Should work well on all ARM chips, pick this if you're unsure. *Don't use on Mac or Windows*. |
| [ReWiz-Qwen-2.5-14B-IQ4_XS.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-IQ4_XS.gguf) | IQ4_XS | 8.12GB | false | Decent quality, smaller than Q4_K_S with similar performance, *recommended*. |
| [ReWiz-Qwen-2.5-14B-Q3_K_L.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q3_K_L.gguf) | Q3_K_L | 7.92GB | false | Lower quality but usable, good for low RAM availability. |
| [ReWiz-Qwen-2.5-14B-Q3_K_M.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q3_K_M.gguf) | Q3_K_M | 7.34GB | false | Low quality. |
| [ReWiz-Qwen-2.5-14B-IQ3_M.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-IQ3_M.gguf) | IQ3_M | 6.92GB | false | Medium-low quality, new method with decent performance comparable to Q3_K_M. |
| [ReWiz-Qwen-2.5-14B-Q3_K_S.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q3_K_S.gguf) | Q3_K_S | 6.66GB | false | Low quality, not recommended. |
| [ReWiz-Qwen-2.5-14B-Q2_K_L.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q2_K_L.gguf) | Q2_K_L | 6.53GB | false | Uses Q8_0 for embed and output weights. Very low quality but surprisingly usable. |
| [ReWiz-Qwen-2.5-14B-IQ3_XS.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-IQ3_XS.gguf) | IQ3_XS | 6.38GB | false | Lower quality, new method with decent performance, slightly better than Q3_K_S. |
| [ReWiz-Qwen-2.5-14B-Q2_K.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-Q2_K.gguf) | Q2_K | 5.77GB | false | Very low quality but surprisingly usable. |
| [ReWiz-Qwen-2.5-14B-IQ2_M.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-IQ2_M.gguf) | IQ2_M | 5.36GB | false | Relatively low quality, uses SOTA techniques to be surprisingly usable. |
| [ReWiz-Qwen-2.5-14B-IQ2_S.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-IQ2_S.gguf) | IQ2_S | 5.00GB | false | Low quality, uses SOTA techniques to be usable. |
| [ReWiz-Qwen-2.5-14B-IQ2_XS.gguf](https://huggingface.co/bartowski/ReWiz-Qwen-2.5-14B-GGUF/blob/main/ReWiz-Qwen-2.5-14B-IQ2_XS.gguf) | IQ2_XS | 4.70GB | false | Low quality, uses SOTA techniques to be usable. |
## Embed/output weights
Some of these quants (Q3_K_XL, Q4_K_L etc) are the standard quantization method with the embeddings and output weights quantized to Q8_0 instead of what they would normally default to.
Some say that this improves the quality, others don't notice any difference. If you use these models PLEASE COMMENT with your findings. I would like feedback that these are actually used and useful so I don't keep uploading quants no one is using.
Thanks!
## Downloading using huggingface-cli
First, make sure you have hugginface-cli installed:
```
pip install -U "huggingface_hub[cli]"
```
Then, you can target the specific file you want:
```
huggingface-cli download bartowski/ReWiz-Qwen-2.5-14B-GGUF --include "ReWiz-Qwen-2.5-14B-Q4_K_M.gguf" --local-dir ./
```
If the model is bigger than 50GB, it will have been split into multiple files. In order to download them all to a local folder, run:
```
huggingface-cli download bartowski/ReWiz-Qwen-2.5-14B-GGUF --include "ReWiz-Qwen-2.5-14B-Q8_0/*" --local-dir ./
```
You can either specify a new local-dir (ReWiz-Qwen-2.5-14B-Q8_0) or download them all in place (./)
## Q4_0_X_X
These are *NOT* for Metal (Apple) offloading, only ARM chips.
If you're using an ARM chip, the Q4_0_X_X quants will have a substantial speedup. Check out Q4_0_4_4 speed comparisons [on the original pull request](https://github.com/ggerganov/llama.cpp/pull/5780#pullrequestreview-21657544660)
To check which one would work best for your ARM chip, you can check [AArch64 SoC features](https://gpages.juszkiewicz.com.pl/arm-socs-table/arm-socs.html) (thanks EloyOn!).
## Which file should I choose?
A great write up with charts showing various performances is provided by Artefact2 [here](https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9)
The first thing to figure out is how big a model you can run. To do this, you'll need to figure out how much RAM and/or VRAM you have.
If you want your model running as FAST as possible, you'll want to fit the whole thing on your GPU's VRAM. Aim for a quant with a file size 1-2GB smaller than your GPU's total VRAM.
If you want the absolute maximum quality, add both your system RAM and your GPU's VRAM together, then similarly grab a quant with a file size 1-2GB Smaller than that total.
Next, you'll need to decide if you want to use an 'I-quant' or a 'K-quant'.
If you don't want to think too much, grab one of the K-quants. These are in format 'QX_K_X', like Q5_K_M.
If you want to get more into the weeds, you can check out this extremely useful feature chart:
[llama.cpp feature matrix](https://github.com/ggerganov/llama.cpp/wiki/Feature-matrix)
But basically, if you're aiming for below Q4, and you're running cuBLAS (Nvidia) or rocBLAS (AMD), you should look towards the I-quants. These are in format IQX_X, like IQ3_M. These are newer and offer better performance for their size.
These I-quants can also be used on CPU and Apple Metal, but will be slower than their K-quant equivalent, so speed vs performance is a tradeoff you'll have to decide.
The I-quants are *not* compatible with Vulcan, which is also AMD, so if you have an AMD card double check if you're using the rocBLAS build or the Vulcan build. At the time of writing this, LM Studio has a preview with ROCm support, and other inference engines have specific builds for ROCm.
## Credits
Thank you kalomaze and Dampf for assistance in creating the imatrix calibration dataset
Thank you ZeroWw for the inspiration to experiment with embed/output
Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e65136b965671f30196d8b55d32e13727276289cffa7b3f940bb317d8feb1bf4
size 5356144320

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3e60234d86dad34f488d5a0d4ceb1a16f5fe96cb3c2cad8bdd5f554aa5bf54d6
size 5003724480

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:45bf0c520b3285d057055093bbd59dd29c9cd4fd19b29ad4b9740c1d5423686d
size 4704573120

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ff3001452878d9c0c09052a432f4e85dc003859f03aadbbd2ee280f31cd340b9
size 6916536000

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7975848605b9221b13e50b655d6ffbde7591a6153f6176cdca0f6d44610bad35
size 6383359680

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b99c30cd27a0c2786e3ddcd8a57fd3d89138d627c7367b830b2df8c4689ef32f
size 8119838400

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:be47f0e496dbcc9333eec730582e50e770f35fe58ff27c3c19b2ac1f8237d404
size 5770495680

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:03574171a442c7f861d9ab7a32139b75f5a51bb100a868a16f014821e74d2723
size 6530815680

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8c4771c4a87c1cacbecd25b38dbf50ba0c36edce2db86b693581ec505b1d92bb
size 7924766400

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9e940c0edf6496abf09dd9e7c1e42a535b3768044e790a6c3d9ae76cbec598cd
size 7339202240

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:40a989fcb19c79f1ed9c0a986f1ed02533f1f5d094145699f6f2dd3322c895cc
size 6659593920

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ee8d8030a8d2043c4beabe8281df93b78041959be3c3e9274803ce86efe61d07
size 8606013120

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f413333d295892a74bcbf3f9ae0091477a11ebc626d1ee110b551c97200177d5
size 8544265920

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:17c80d57173ec46ab84da6d15d208238bb5115c73b677e648856a891e84e9a09
size 8517723840

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e78f3b7d28af01a20e1e534048e335a37a52218f7b89f10fc4321eaeb81e9f30
size 8517723840

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d827184cd2ff22c8653c20512432141a6365020b8de88aac10513fc9ba18254b
size 8517723840

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a2564b04169b816c7ad0ce2ee2bae0be94dfe0c111d0a46b6b273508420a5556
size 9565951680

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2b01e4cb98dba44fd07507a0c560d77555e7bb27f70f47556edd2141a94cde21
size 8988108480

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:bc0ad985d6fc88d442598236620991a6e7f9a9d5c0bd21993b545d106a9af890
size 8573429440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ada535cbf3d30d02f1992d9ef8e2cac4156882da84708692e0d2c66c92cf84cc
size 10989393600

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:34ace04dfeaa3b864d2838a780da21c0ae8109d7222e1675aa213aed3c73ea67
size 10508871360

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2309bc703bc4594285a74a2b4c529da12d1ade401cb197a52386bc848fdf9de9
size 10266552000

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:98305171ebed3d8274305afa9c382c27c96110cc78c79de83f377af24f096ab7
size 12124681920

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6ab1958c13354ce1ffbef25ca42fdb3ddd5a1ef80382f27ce1af423597d463c3
size 12501800640

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4408f204c85a19bc18c13c97ba59701987f6444b640ed3d953505b3e9800a9a8
size 15701595840

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:96dc2dd2d6ee05350c9bc2e80124b5b895cdb7b9ad2b4b1991d495ba312a458e
size 29547713984

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e4d1c3b2e76dbb2a2c906703ddcd31a138902bc2f9663004fe719892a64600ce
size 8563610

1
configuration.json Normal file
View File

@@ -0,0 +1 @@
{"framework": "pytorch", "task": "text-generation", "allow_remote": true}