初始化项目,由ModelHub XC社区提供模型

Model: Lewdiculous/Erosumika-7B-GGUF-IQ-Imatrix
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-15 19:00:53 +08:00
commit 03a52983d1
16 changed files with 164 additions and 0 deletions

49
.gitattributes vendored Normal file
View File

@@ -0,0 +1,49 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-F16.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-Q6_K-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-Q8_0-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-Q5_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-Q5_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
imatrix.dat filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-IQ4_NL-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-IQ4_XS-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-Q4_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-Q4_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-IQ3_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-IQ3_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-IQ3_XS-imat.gguf filter=lfs diff=lfs merge=lfs -text
Erosumika-7B-IQ3_XXS-imat.gguf filter=lfs diff=lfs merge=lfs -text

3
Erosumika-7B-F16.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:68f87091424573bafaf09adc4ae8dfd83fa6c9f3f3c7c84de22ea7d17f69ab10
size 14484731968

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c71cf14d380f2dcd20e8a32cb0de16eb9fd2c54f842cc769569b9acc36f7c8e4
size 3284891744

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:706f41ae3909c8311b5529e5dd8a1eeea712367135e35d11b279af08b8701052
size 3182393440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e8e7b26495b1948b4a5c6b9e101ce42f1a5f499158c69f275a77dc5e6c11503f
size 3000989792

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d0f542a833f7dd57d0b202af6ff5eebbd7517143adbc070070a485389e9db8f4
size 2827343968

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6faba7ce4c7e013d0690a731c29986f5676b9263328a592c084a6984a7787b5f
size 4125694048

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5975e41d358903447cdd14c11ea25f81025f685cce99f69643b37efc5b7d6e77
size 3907688544

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0c68494ab397724c883faff384fb8ac4a8382406b1f518bf3c6dfa7d07bafee7
size 4368439392

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2f648ca8503cb5169c2e7bd3f1d25440534ba50c337990d358b628a1d617143f
size 4140374112

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3f62b46265a2ccd966470d1db322c7251e48641f626d29dc682fbc01fa39d792
size 5131409504

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3fc88c90c1ce6cceddfa2185df51501a9ffb830343a3a5989bb2ab9552569733
size 4997716064

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:55c3efa3f489101cc25d6344ac54b5ae1c07add0bbf73ac4d5927986414769e7
size 5942065248

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d7c3b22c2882152186989cac59ed3acd5cc976ec7f2dc7dac72203d2c1cb47da
size 7695857760

73
README.md Normal file
View File

@@ -0,0 +1,73 @@
---
library_name: transformers
tags:
- mistral
- quantized
- text-generation-inference
- roleplay
- gguf
pipeline_tag: text-generation
inference: false
license: cc-by-4.0
---
## GGUF-Imatrix quantizations for [localfultonextractor/Erosumika-7B](https://huggingface.co/localfultonextractor/Erosumika-7B/).
All credits belong to the author.
If you like these also check out [FantasiaFoundry's GGUF-Quantization-Script](https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script).
## What does "Imatrix" mean?
It stands for **Importance Matrix**, a technique used to improve the quality of quantized models. <br>
[[1]](https://github.com/ggerganov/llama.cpp/discussions/5006/) <br>
The **Imatrix** is calculated based on calibration data, and it helps determine the importance of different model activations during the quantization process. The idea is to preserve the most important information during quantization, which can help reduce the loss of model performance and lead to better performance, especially when the calibration data is diverse. <br>
[[2]](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384/)
For --imatrix data, included `imatrix.dat` was used.
Using [llama.cpp-b2327](https://github.com/ggerganov/llama.cpp/releases/tag/b2327/):
```
Base⇢ GGUF(F16)⇢ Imatrix-Data(F16)⇢ GGUF(Imatrix-Quants)
```
The new **IQ3_S** quant-option has shown to be better than the old Q3_K_S, so I added that instead of the later. Only supported in `koboldcpp-1.59.1` or higher.
If you want any specific quantization to be added, feel free to ask.
<!-- ## Model image: -->
## Original model information:
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6512681f4151fb1fa719e033/AU4YsdxSuyVM0vVh27Cu-.png)
# Erosumika-7B
This is an attempt to create a model that combines multiple "established" 7Bs and a very small WIP private dataset with [Eros'](https://huggingface.co/tavtav/eros-7b-test) raw creative power. In terms of instruction formats, ChatML and Alpaca work best. The merge isn't purely ChatML, and as such, my previous attempts to integrate it with ChatML strings out of the box were Sisyphean and uninformed.
[GGUF](https://huggingface.co/localfultonextractor/Erosumika-7B-GGUF)
[exl2, 4bpw](https://huggingface.co/localfultonextractor/Erosumika-7B-4.0bpw-exl2)
[exl2, 6bpw](https://huggingface.co/localfultonextractor/Erosumika-7B-6.0bpw-exl2)
# Merge config.yml:
* I was asked to upload the merge configuration I used, sadly the one for the 'sumitest02' model is lost to time, like tears in rain:
```slices:
- sources:
- model: localfultonextractor/sumitest02
layer_range: [0, 32]
- model: tavtav/eros-7b-test
layer_range: [0, 32]
merge_method: slerp
base_model: localfultonextractor/sumitest02
parameters:
t:
- filter: self_attn
value: [0, 0.2, 0.4, 0.55, 0.8]
- filter: mlp
value: [0.7, 0.3, 0.4, 0.3, 0]
- value: 0.37 # fallback for rest of tensors
dtype: float16
```

3
imatrix.dat Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:625d54370e4d75aabcc171db3b5d481403ee76bd1929cc37beccc81258affb00
size 4988126