初始化项目,由ModelHub XC社区提供模型

Model: mradermacher/Firefly-V2-i1-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-30 09:15:59 +08:00
commit cc7adeedd6
27 changed files with 225 additions and 0 deletions

60
.gitattributes vendored Normal file
View File

@@ -0,0 +1,60 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
Firefly-V2.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Firefly-V2.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text

3
Firefly-V2.i1-IQ1_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6ae1cc876d0fe0e7df4e6d4ed5ffb398f49112cc5dcea7ae9fcd20b83d03e821
size 924188800

3
Firefly-V2.i1-IQ1_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fa61a76d741281a8543f4abfcc6c7d19633faf53f5828b4e54553387582fb792
size 868155520

3
Firefly-V2.i1-IQ2_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:28cf7bb869e5dfe3c12fa1a457ed4e48943d8d0d5f19e145ca9a62e36439f473
size 1229029504

3
Firefly-V2.i1-IQ2_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:622f5f133245cc57aec65ddcecd68c160d815831e9fcec30a67b5e40a6e755b4
size 1154318464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:759a0bd5db88313c61cebc1b9edb4466cf63bc335e4f3b2ec8cac4e1ab6d7301
size 1100546176

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:17bf5cbb6511ee178954abc99fec91f0285fb166f11060836b9d2f3bf0a3ec8a
size 1017577600

3
Firefly-V2.i1-IQ3_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e5de546fa9247d5684b0b894ab85b90a4eca1bf5f35a3cbf8b6ba4f3eaa930a5
size 1599666304

3
Firefly-V2.i1-IQ3_S.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:17d11fdda891bc0b0420950e4e7bf3894f4ed691236f249ee4dbb73fb82f2941
size 1542846592

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b3c439aa5f15ce5e7de066ec0999a6f28c16ecfc2a67b9550ec996acc8ac3f5e
size 1476786304

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:847af97326efdc5fed53c5a52800faba0b86f8436e6c6176ca984b901600b964
size 1348763776

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:58f95249b731cc334b1b9ba89f2fd5ec78acceacec27149eab563668ac4e557a
size 1917188224

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ba925620ab7b0a396b88108aa70e4a27faa6132c24ff03e997667b3ee2852bf6
size 1829107840

3
Firefly-V2.i1-Q2_K.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:849714eade913bb9b88397314748ef6449e40b14ec4cd18fbbb5a2d110bea92f
size 1363933312

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d35463bb83ddf342a9578b8d313b74a468f392405d1fa877dedc049f5302f1d9
size 1274280064

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:cca766594533cb66182719f013508210419d42ab3196b5ca2c6607625ae6901b
size 1815345280

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a97079b70bea872e56dcb547e11121a121a416d0d85f31ebc2d14f7b23ed495f
size 1687156864

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2aa45949e0041ce0a68aad05ad38c70718fb6afe68158d53217b55ca084031f2
size 1542846592

3
Firefly-V2.i1-Q4_0.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a5414d085ea2a2a76a2c5e4d76cd4c3476e41c784066b790067fb78e49abdfa7
size 1921906816

3
Firefly-V2.i1-Q4_1.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f0ec43f0550dd795401b07d010dff8e6cd7a351ef0ca76f5aaee2a239b2a8210
size 2093348992

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:cafe2cfa78d462338b89953b10841852c3c112a9899c91b47d85642f5c0e1fb7
size 2019375232

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c6bd75034a4c58ba5c615d77dbd869cb14914442bef4fda26f95f416bd907d8d
size 1928198272

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c7a03300bc28430b0f5774f10a0080f2999c96c227ceae921191265c7de8d5b3
size 2322151552

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7977002ce04654936b4269d45435afe59c123846eb40d17f2fc01cd1eaf90036
size 2269509760

3
Firefly-V2.i1-Q6_K.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e0f73d40707e9fc18c23eedfe0c691d310f0e28f09a131730a557caaac5dad0c
size 2643851392

3
Firefly-V2.imatrix.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2fd5ebe498efccf427e78738d62a7edad146c85fd12266e13e575082a1c33008
size 3012064

90
README.md Normal file
View File

@@ -0,0 +1,90 @@
---
base_model: Guilherme34/Firefly-V2
language:
- en
library_name: transformers
mradermacher:
readme_rev: 1
quantized_by: mradermacher
tags:
- merge
- mergekit
- lazymergekit
- Guilherme34/Firefly
- SicariusSicariiStuff/Impish_LLAMA_3B
---
## About
<!-- ### quantize_version: 2 -->
<!-- ### output_tensor_quantised: 1 -->
<!-- ### convert_type: hf -->
<!-- ### vocab_type: -->
<!-- ### tags: nicoboss -->
<!-- ### quants: Q2_K IQ3_M Q4_K_S IQ3_XXS Q3_K_M small-IQ4_NL Q4_K_M IQ2_M Q6_K IQ4_XS Q2_K_S IQ1_M Q3_K_S IQ2_XXS Q3_K_L IQ2_XS Q5_K_S IQ2_S IQ1_S Q5_K_M Q4_0 IQ3_XS Q4_1 IQ3_S -->
<!-- ### quants_skip: -->
<!-- ### skip_mmproj: -->
weighted/imatrix quants of https://huggingface.co/Guilherme34/Firefly-V2
<!-- provided-files -->
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Firefly-V2-i1-GGUF).***
static quants are available at https://huggingface.co/mradermacher/Firefly-V2-GGUF
## Usage
If you are unsure how to use GGUF files, refer to one of [TheBloke's
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
more details, including on how to concatenate multi-part files.
## Provided Quants
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
| Link | Type | Size/GB | Notes |
|:-----|:-----|--------:|:------|
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ1_S.gguf) | i1-IQ1_S | 1.0 | for the desperate |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ1_M.gguf) | i1-IQ1_M | 1.0 | mostly desperate |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 1.1 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ2_XS.gguf) | i1-IQ2_XS | 1.2 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ2_S.gguf) | i1-IQ2_S | 1.3 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ2_M.gguf) | i1-IQ2_M | 1.3 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q2_K_S.gguf) | i1-Q2_K_S | 1.4 | very low quality |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 1.4 | lower quality |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q2_K.gguf) | i1-Q2_K | 1.5 | IQ3_XXS probably better |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ3_XS.gguf) | i1-IQ3_XS | 1.6 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ3_S.gguf) | i1-IQ3_S | 1.6 | beats Q3_K* |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q3_K_S.gguf) | i1-Q3_K_S | 1.6 | IQ3_XS probably better |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ3_M.gguf) | i1-IQ3_M | 1.7 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q3_K_M.gguf) | i1-Q3_K_M | 1.8 | IQ3_S probably better |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q3_K_L.gguf) | i1-Q3_K_L | 1.9 | IQ3_M probably better |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ4_XS.gguf) | i1-IQ4_XS | 1.9 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-IQ4_NL.gguf) | i1-IQ4_NL | 2.0 | prefer IQ4_XS |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q4_0.gguf) | i1-Q4_0 | 2.0 | fast, low quality |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q4_K_S.gguf) | i1-Q4_K_S | 2.0 | optimal size/speed/quality |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q4_K_M.gguf) | i1-Q4_K_M | 2.1 | fast, recommended |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q4_1.gguf) | i1-Q4_1 | 2.2 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q5_K_S.gguf) | i1-Q5_K_S | 2.4 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q5_K_M.gguf) | i1-Q5_K_M | 2.4 | |
| [GGUF](https://huggingface.co/mradermacher/Firefly-V2-i1-GGUF/resolve/main/Firefly-V2.i1-Q6_K.gguf) | i1-Q6_K | 2.7 | practically like static Q6_K |
Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):
![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
## FAQ / Model Request
See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.
## Thanks
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
<!-- end -->