初始化项目,由ModelHub XC社区提供模型

Model: bartowski/arcee-ai_Virtuoso-Large-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-11 13:30:33 +08:00
commit d3c4047312
33 changed files with 314 additions and 0 deletions

49
.gitattributes vendored Normal file
View File

@@ -0,0 +1,49 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bin.* filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zstandard filter=lfs diff=lfs merge=lfs -text
*.tfevents* filter=lfs diff=lfs merge=lfs -text
*.db* filter=lfs diff=lfs merge=lfs -text
*.ark* filter=lfs diff=lfs merge=lfs -text
**/*ckpt*data* filter=lfs diff=lfs merge=lfs -text
**/*ckpt*.meta filter=lfs diff=lfs merge=lfs -text
**/*ckpt*.index filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.gguf* filter=lfs diff=lfs merge=lfs -text
*.ggml filter=lfs diff=lfs merge=lfs -text
*.llamafile* filter=lfs diff=lfs merge=lfs -text
*.pt2 filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
arcee-ai_Virtuoso-Large.imatrix filter=lfs diff=lfs merge=lfs -text

174
README.md Normal file
View File

@@ -0,0 +1,174 @@
---
quantized_by: bartowski
pipeline_tag: text-generation
license_link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE
license_name: qwen
base_model: arcee-ai/Virtuoso-Large
license: other
base_model_relation: quantized
---
## Llamacpp imatrix Quantizations of Virtuoso-Large by arcee-ai
Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b5697">b5697</a> for quantization.
Original model: https://huggingface.co/arcee-ai/Virtuoso-Large
All quants made using imatrix option with dataset from [here](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
Run them in [LM Studio](https://lmstudio.ai/)
Run them directly with [llama.cpp](https://github.com/ggerganov/llama.cpp), or any other llama.cpp based project
## Prompt format
```
<|im_start|>system
{system_prompt}<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant
```
## Download a file (not the whole branch) from below:
| Filename | Quant type | File Size | Split | Description |
| -------- | ---------- | --------- | ----- | ----------- |
| [Virtuoso-Large-Q8_0.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/tree/main/arcee-ai_Virtuoso-Large-Q8_0) | Q8_0 | 77.26GB | true | Extremely high quality, generally unneeded but max available quant. |
| [Virtuoso-Large-Q6_K.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/tree/main/arcee-ai_Virtuoso-Large-Q6_K) | Q6_K | 64.35GB | true | Very high quality, near perfect, *recommended*. |
| [Virtuoso-Large-Q5_K_M.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/tree/main/arcee-ai_Virtuoso-Large-Q5_K_M) | Q5_K_M | 54.45GB | true | High quality, *recommended*. |
| [Virtuoso-Large-Q5_K_S.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/tree/main/arcee-ai_Virtuoso-Large-Q5_K_S) | Q5_K_S | 51.38GB | true | High quality, *recommended*. |
| [Virtuoso-Large-Q4_K_L.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q4_K_L.gguf) | Q4_K_L | 48.34GB | false | Uses Q8_0 for embed and output weights. Good quality, *recommended*. |
| [Virtuoso-Large-Q4_K_M.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q4_K_M.gguf) | Q4_K_M | 47.42GB | false | Good quality, default size for most use cases, *recommended*. |
| [Virtuoso-Large-Q4_1.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q4_1.gguf) | Q4_1 | 45.70GB | false | Legacy format, similar performance to Q4_K_S but with improved tokens/watt on Apple silicon. |
| [Virtuoso-Large-Q4_K_S.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q4_K_S.gguf) | Q4_K_S | 43.89GB | false | Slightly lower quality with more space savings, *recommended*. |
| [Virtuoso-Large-Q4_0.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q4_0.gguf) | Q4_0 | 41.38GB | false | Legacy format, offers online repacking for ARM and AVX CPU inference. |
| [Virtuoso-Large-IQ4_NL.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ4_NL.gguf) | IQ4_NL | 41.32GB | false | Similar to IQ4_XS, but slightly larger. Offers online repacking for ARM CPU inference. |
| [Virtuoso-Large-Q3_K_XL.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q3_K_XL.gguf) | Q3_K_XL | 40.60GB | false | Uses Q8_0 for embed and output weights. Lower quality but usable, good for low RAM availability. |
| [Virtuoso-Large-IQ4_XS.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ4_XS.gguf) | IQ4_XS | 39.71GB | false | Decent quality, smaller than Q4_K_S with similar performance, *recommended*. |
| [Virtuoso-Large-Q3_K_L.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q3_K_L.gguf) | Q3_K_L | 39.51GB | false | Lower quality but usable, good for low RAM availability. |
| [Virtuoso-Large-Q3_K_M.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q3_K_M.gguf) | Q3_K_M | 37.70GB | false | Low quality. |
| [Virtuoso-Large-IQ3_M.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ3_M.gguf) | IQ3_M | 35.50GB | false | Medium-low quality, new method with decent performance comparable to Q3_K_M. |
| [Virtuoso-Large-Q3_K_S.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q3_K_S.gguf) | Q3_K_S | 34.49GB | false | Low quality, not recommended. |
| [Virtuoso-Large-IQ3_XS.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ3_XS.gguf) | IQ3_XS | 32.84GB | false | Lower quality, new method with decent performance, slightly better than Q3_K_S. |
| [Virtuoso-Large-IQ3_XXS.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ3_XXS.gguf) | IQ3_XXS | 31.85GB | false | Lower quality, new method with decent performance, comparable to Q3 quants. |
| [Virtuoso-Large-Q2_K_L.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q2_K_L.gguf) | Q2_K_L | 31.03GB | false | Uses Q8_0 for embed and output weights. Very low quality but surprisingly usable. |
| [Virtuoso-Large-Q2_K.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-Q2_K.gguf) | Q2_K | 29.81GB | false | Very low quality but surprisingly usable. |
| [Virtuoso-Large-IQ2_M.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ2_M.gguf) | IQ2_M | 29.34GB | false | Relatively low quality, uses SOTA techniques to be surprisingly usable. |
| [Virtuoso-Large-IQ2_S.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ2_S.gguf) | IQ2_S | 27.94GB | false | Low quality, uses SOTA techniques to be usable. |
| [Virtuoso-Large-IQ2_XS.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ2_XS.gguf) | IQ2_XS | 27.06GB | false | Low quality, uses SOTA techniques to be usable. |
| [Virtuoso-Large-IQ2_XXS.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ2_XXS.gguf) | IQ2_XXS | 25.49GB | false | Very low quality, uses SOTA techniques to be usable. |
| [Virtuoso-Large-IQ1_M.gguf](https://huggingface.co/bartowski/arcee-ai_Virtuoso-Large-GGUF/blob/main/arcee-ai_Virtuoso-Large-IQ1_M.gguf) | IQ1_M | 23.74GB | false | Extremely low quality, *not* recommended. |
## Embed/output weights
Some of these quants (Q3_K_XL, Q4_K_L etc) are the standard quantization method with the embeddings and output weights quantized to Q8_0 instead of what they would normally default to.
## Downloading using huggingface-cli
<details>
<summary>Click to view download instructions</summary>
First, make sure you have hugginface-cli installed:
```
pip install -U "huggingface_hub[cli]"
```
Then, you can target the specific file you want:
```
huggingface-cli download bartowski/arcee-ai_Virtuoso-Large-GGUF --include "arcee-ai_Virtuoso-Large-Q4_K_M.gguf" --local-dir ./
```
If the model is bigger than 50GB, it will have been split into multiple files. In order to download them all to a local folder, run:
```
huggingface-cli download bartowski/arcee-ai_Virtuoso-Large-GGUF --include "arcee-ai_Virtuoso-Large-Q8_0/*" --local-dir ./
```
You can either specify a new local-dir (arcee-ai_Virtuoso-Large-Q8_0) or download them all in place (./)
</details>
## ARM/AVX information
Previously, you would download Q4_0_4_4/4_8/8_8, and these would have their weights interleaved in memory in order to improve performance on ARM and AVX machines by loading up more data in one pass.
Now, however, there is something called "online repacking" for weights. details in [this PR](https://github.com/ggerganov/llama.cpp/pull/9921). If you use Q4_0 and your hardware would benefit from repacking weights, it will do it automatically on the fly.
As of llama.cpp build [b4282](https://github.com/ggerganov/llama.cpp/releases/tag/b4282) you will not be able to run the Q4_0_X_X files and will instead need to use Q4_0.
Additionally, if you want to get slightly better quality for , you can use IQ4_NL thanks to [this PR](https://github.com/ggerganov/llama.cpp/pull/10541) which will also repack the weights for ARM, though only the 4_4 for now. The loading time may be slower but it will result in an overall speed incrase.
<details>
<summary>Click to view Q4_0_X_X information (deprecated</summary>
I'm keeping this section to show the potential theoretical uplift in performance from using the Q4_0 with online repacking.
<details>
<summary>Click to view benchmarks on an AVX2 system (EPYC7702)</summary>
| model | size | params | backend | threads | test | t/s | % (vs Q4_0) |
| ------------------------------ | ---------: | ---------: | ---------- | ------: | ------------: | -------------------: |-------------: |
| qwen2 3B Q4_0 | 1.70 GiB | 3.09 B | CPU | 64 | pp512 | 204.03 ± 1.03 | 100% |
| qwen2 3B Q4_0 | 1.70 GiB | 3.09 B | CPU | 64 | pp1024 | 282.92 ± 0.19 | 100% |
| qwen2 3B Q4_0 | 1.70 GiB | 3.09 B | CPU | 64 | pp2048 | 259.49 ± 0.44 | 100% |
| qwen2 3B Q4_0 | 1.70 GiB | 3.09 B | CPU | 64 | tg128 | 39.12 ± 0.27 | 100% |
| qwen2 3B Q4_0 | 1.70 GiB | 3.09 B | CPU | 64 | tg256 | 39.31 ± 0.69 | 100% |
| qwen2 3B Q4_0 | 1.70 GiB | 3.09 B | CPU | 64 | tg512 | 40.52 ± 0.03 | 100% |
| qwen2 3B Q4_K_M | 1.79 GiB | 3.09 B | CPU | 64 | pp512 | 301.02 ± 1.74 | 147% |
| qwen2 3B Q4_K_M | 1.79 GiB | 3.09 B | CPU | 64 | pp1024 | 287.23 ± 0.20 | 101% |
| qwen2 3B Q4_K_M | 1.79 GiB | 3.09 B | CPU | 64 | pp2048 | 262.77 ± 1.81 | 101% |
| qwen2 3B Q4_K_M | 1.79 GiB | 3.09 B | CPU | 64 | tg128 | 18.80 ± 0.99 | 48% |
| qwen2 3B Q4_K_M | 1.79 GiB | 3.09 B | CPU | 64 | tg256 | 24.46 ± 3.04 | 83% |
| qwen2 3B Q4_K_M | 1.79 GiB | 3.09 B | CPU | 64 | tg512 | 36.32 ± 3.59 | 90% |
| qwen2 3B Q4_0_8_8 | 1.69 GiB | 3.09 B | CPU | 64 | pp512 | 271.71 ± 3.53 | 133% |
| qwen2 3B Q4_0_8_8 | 1.69 GiB | 3.09 B | CPU | 64 | pp1024 | 279.86 ± 45.63 | 100% |
| qwen2 3B Q4_0_8_8 | 1.69 GiB | 3.09 B | CPU | 64 | pp2048 | 320.77 ± 5.00 | 124% |
| qwen2 3B Q4_0_8_8 | 1.69 GiB | 3.09 B | CPU | 64 | tg128 | 43.51 ± 0.05 | 111% |
| qwen2 3B Q4_0_8_8 | 1.69 GiB | 3.09 B | CPU | 64 | tg256 | 43.35 ± 0.09 | 110% |
| qwen2 3B Q4_0_8_8 | 1.69 GiB | 3.09 B | CPU | 64 | tg512 | 42.60 ± 0.31 | 105% |
Q4_0_8_8 offers a nice bump to prompt processing and a small bump to text generation
</details>
</details>
## Which file should I choose?
<details>
<summary>Click here for details</summary>
A great write up with charts showing various performances is provided by Artefact2 [here](https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9)
The first thing to figure out is how big a model you can run. To do this, you'll need to figure out how much RAM and/or VRAM you have.
If you want your model running as FAST as possible, you'll want to fit the whole thing on your GPU's VRAM. Aim for a quant with a file size 1-2GB smaller than your GPU's total VRAM.
If you want the absolute maximum quality, add both your system RAM and your GPU's VRAM together, then similarly grab a quant with a file size 1-2GB Smaller than that total.
Next, you'll need to decide if you want to use an 'I-quant' or a 'K-quant'.
If you don't want to think too much, grab one of the K-quants. These are in format 'QX_K_X', like Q5_K_M.
If you want to get more into the weeds, you can check out this extremely useful feature chart:
[llama.cpp feature matrix](https://github.com/ggerganov/llama.cpp/wiki/Feature-matrix)
But basically, if you're aiming for below Q4, and you're running cuBLAS (Nvidia) or rocBLAS (AMD), you should look towards the I-quants. These are in format IQX_X, like IQ3_M. These are newer and offer better performance for their size.
These I-quants can also be used on CPU, but will be slower than their K-quant equivalent, so speed vs performance is a tradeoff you'll have to decide.
</details>
## Credits
Thank you kalomaze and Dampf for assistance in creating the imatrix calibration dataset.
Thank you ZeroWw for the inspiration to experiment with embed/output.
Thank you to LM Studio for sponsoring my work.
Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:64b987900a988c03c52f91ce264a649c0b2ae1196d202c3a018a133162887074
size 23740212544

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d407afef24860231a20fc46bd4695b48355639478a13716065c2fd04c16a81f6
size 29338985792

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:46e1ea30836479ae18d6e666476a5413d8ea7b2788664a914c0903f4a2bf13ed
size 27939136832

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:45357b27119df9e15a88953ac0d3dcead8c03f9c81f0301c4f5a25cccf49a854
size 27057644864

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5cfcfe77a2c9b29327a6dbe860eb0391f84110d4a7f7dd77b05d191d254fd3dd
size 25490023744

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d65a2a2537c80c32cf9d309d6cdaaf7951356158a81b158e469797d651498069
size 35503596864

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f7b9714f7f6978fcc1c4be6d6f78d86f1e8be8285aedcc2170d0ffeff9268dab
size 32842179904

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ab8f09426570ab83f7020ed6a39f37fa24662673ff507ba368d30a438863d27c
size 31845082432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:61afca63ef57140d3c1d40777dee241c59fb6e2c6977443d44a1fc54677edee4
size 41315624256

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d1aee3ff0179934acc9a56589cdb3eb1b7df0aa303187ff0be6169a4d9cd928f
size 39709074752

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:47badcaa8b9452bbf16e9d578692304d57b90eb483208397234cfd49a16306ce
size 29811762496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:194ff1bd1ab932df4dc3bd061a036b790c7a0ad923b52b881e908e45b0a9bdaf
size 31028274496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:564711965cc35b27defabcf671dbb433d1bc2cf4843f5a7e0928fc4de154521d
size 39505225024

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3dad7fdb8d4d8f31efdaef041768eb88f2eb5289030c334bf338de862b4550ae
size 37698725184

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b3e83fcfb553472de430cd4f23dae071ac4a6e532979b17ae13d69def4d4a4ae
size 34487788864

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:76d9f2f891b791457de013a32a4613e61d52929d3fe68c9f2fa606e67029dfe8
size 40595219776

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4e2efa731fd864da15dccf5473eb070f20202604ad73c6df8e2cd4875086705d
size 41383126336

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2df6817c31fa6bf53340d9e16d39d7aed22fd7ae1a300089164c94cf52632749
size 45697885504

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f7c4bdb662dcf5e8fd24cbe9947539fdd8f37ab5808930eccc170d24a0ade236
size 48340264256

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:930405e1be9395d55dc997c48c515d4cb6b6e6de28f94e054f602ccc6c575c1d
size 47415715136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:42649a583dacd3a8392b1ca0ade8d667b52ce8d32bc708e20b12609baa8b9a0a
size 43889222976

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:877a47130b740894c85cd74dff1dd1d1b53cb5c363075bd976da8b40556f05f8
size 39866043424

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:aebdb5318364de7f55d3054f629cbac53ac18b2393f3e57940c8cea3369acd31
size 14581422560

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:78c4a06e13f1bde33b5b1d6c458053c50f2f3fc9048d0ada9115031d90355809
size 39906604160

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4044d4d69a866d4b069efa9fdfd95cb44a9446675bbba06f44b93f1a9b69b107
size 11468534144

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:59c96da1443f54cdf1c209e9d2737f07caf437ea7b5a67fc56603e4f602ddc98
size 39809757568

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:521a263810873aac0f29136f3cc1cbab7140ce410b4ef03fa0726f826b07b1e0
size 24537871488

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4fbd6ed836c0be76c4cda2b556a832365246268e85b2486b2fd2a75a6e4ce125
size 39966804672

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:58960c9a025d61d3b2bc518b0e953e2c9c18b4e15128ed311f9291304a07c7f8
size 37295807328

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:00998783b06a2ac9ef7dc058cc26f7d134bb9c8f1a834837c8e2018a44b6e911
size 25209018

1
configuration.json Normal file
View File

@@ -0,0 +1 @@
{"framework": "pytorch", "task": "text-generation", "allow_remote": true}