初始化项目,由ModelHub XC社区提供模型
Model: Lewdiculous/Erosumika-7B-v2-GGUF-IQ-Imatrix Source: Original Platform
This commit is contained in:
47
.gitattributes
vendored
Normal file
47
.gitattributes
vendored
Normal file
@@ -0,0 +1,47 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-F16.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-IQ3_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-IQ3_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-IQ3_XXS-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-IQ4_XS-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-Q4_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-Q4_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-Q5_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-Q5_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-Q6_K-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Erosumika-7B-v2-Q8_0-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
||||
3
Erosumika-7B-v2-F16.gguf
Normal file
3
Erosumika-7B-v2-F16.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:364f31375bbb6fc153e436b1fbb5cb702ccd7af7500b9ab15743bf7effe420bf
|
||||
size 14484731616
|
||||
3
Erosumika-7B-v2-IQ3_M-imat.gguf
Normal file
3
Erosumika-7B-v2-IQ3_M-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ed7601a8da76c0e5bf00dd851f17e576fb05004f42dd4ab290c47d4d49d146b3
|
||||
size 3284891392
|
||||
3
Erosumika-7B-v2-IQ3_S-imat.gguf
Normal file
3
Erosumika-7B-v2-IQ3_S-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:169078dc83c7a470ab763966744281ec80aa879de4c159ed921899878d5bf2f3
|
||||
size 3182393088
|
||||
3
Erosumika-7B-v2-IQ3_XXS-imat.gguf
Normal file
3
Erosumika-7B-v2-IQ3_XXS-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:931baa1465af8d9a5ef4e6947078208bfe2a94d245e198582a824716f7ba453c
|
||||
size 2827343616
|
||||
3
Erosumika-7B-v2-IQ4_XS-imat.gguf
Normal file
3
Erosumika-7B-v2-IQ4_XS-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f0e80eb8f2a96cb553c4857e20379b01aa889d79f35b6c9fd3e6acee7c145b11
|
||||
size 3907688192
|
||||
3
Erosumika-7B-v2-Q4_K_M-imat.gguf
Normal file
3
Erosumika-7B-v2-Q4_K_M-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ff4af1c659353ad1db49554730b43d20b59a92fb86f0b0162743f182811c2446
|
||||
size 4368439040
|
||||
3
Erosumika-7B-v2-Q4_K_S-imat.gguf
Normal file
3
Erosumika-7B-v2-Q4_K_S-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:50429ff0ab74a85707a0201e113d134a783dcba431d77be576a0b847fc39bcfa
|
||||
size 4140373760
|
||||
3
Erosumika-7B-v2-Q5_K_M-imat.gguf
Normal file
3
Erosumika-7B-v2-Q5_K_M-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2ae9d23e45013a661356d39710f64f6d08f4385eb020bea8b7c04f1e7e00ba1d
|
||||
size 5131409152
|
||||
3
Erosumika-7B-v2-Q5_K_S-imat.gguf
Normal file
3
Erosumika-7B-v2-Q5_K_S-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:340543c7b07c5ef1205a99b79bcf25ab3bad0d839132bd285145aac2047065b8
|
||||
size 4997715712
|
||||
3
Erosumika-7B-v2-Q6_K-imat.gguf
Normal file
3
Erosumika-7B-v2-Q6_K-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:38204892e4f2d197c58fbd5733eff9dfe83d26145641cc28c58bd724a6961d42
|
||||
size 5942064896
|
||||
3
Erosumika-7B-v2-Q8_0-imat.gguf
Normal file
3
Erosumika-7B-v2-Q8_0-imat.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3b15fb890018d138d293c2ef15e6e2fa00c26a47c39bc04060cf51a6bfd81fd6
|
||||
size 7695857408
|
||||
85
README.md
Normal file
85
README.md
Normal file
@@ -0,0 +1,85 @@
|
||||
---
|
||||
language:
|
||||
- en
|
||||
pipeline_tag: text-generation
|
||||
tags:
|
||||
- text-generation-inference
|
||||
- instruct
|
||||
- conversational
|
||||
- roleplay
|
||||
- sillytavern
|
||||
- gguf
|
||||
- anime
|
||||
- quantized
|
||||
- mistral
|
||||
license: cc-by-4.0
|
||||
---
|
||||
|
||||
# **THIS VERSION IS NOW DEPRECATED. USE V3-0.2. V2 HAS PROBLEMS WITH ALIGNMENT AND THE NEW VERSION IS A SUBSTANTIAL IMPROVMENT!**
|
||||
|
||||
This repository hosts deprecated GGUF-IQ-Imatrix quants for [localfultonextractor/Erosumika-7B-v2](https://huggingface.co/localfultonextractor/Erosumika-7B-v2).
|
||||
|
||||
*"Better, smarter erosexika!!"*
|
||||
|
||||
[Quantized as per user request.](https://huggingface.co/Lewdiculous/Model-Requests/discussions/19)
|
||||
|
||||
Quants:
|
||||
```python
|
||||
quantization_options = [
|
||||
"Q4_K_M", "Q4_K_S", "IQ4_XS", "Q5_K_M", "Q5_K_S",
|
||||
"Q6_K", "Q8_0", "IQ3_M", "IQ3_S", "IQ3_XXS"
|
||||
]
|
||||
```
|
||||
|
||||
**What does "Imatrix" mean?**
|
||||
|
||||
It stands for **Importance Matrix**, a technique used to improve the quality of quantized models.
|
||||
The **Imatrix** is calculated based on calibration data, and it helps determine the importance of different model activations during the quantization process.
|
||||
The idea is to preserve the most important information during quantization, which can help reduce the loss of model performance, especially when the calibration data is diverse.
|
||||
[[1]](https://github.com/ggerganov/llama.cpp/discussions/5006) [[2]](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384)
|
||||
|
||||
For imatrix data generation, kalomaze's `groups_merged.txt` with added roleplay chats was used, you can find it [here](https://huggingface.co/Lewdiculous/Datura_7B-GGUF-Imatrix/blob/main/imatrix-with-rp-format-data.txt). This was just to add a bit more diversity to the data.
|
||||
|
||||
**Steps:**
|
||||
|
||||
```
|
||||
Base⇢ GGUF(F16)⇢ Imatrix-Data(F16)⇢ GGUF(Imatrix-Quants)
|
||||
```
|
||||
*Using the latest llama.cpp at the time.*
|
||||
|
||||
# Original model information:
|
||||
|
||||
<h1 style="text-align: center">Erosumika-7B-v2</h1>
|
||||
|
||||

|
||||
|
||||
## Model Details
|
||||
A DARE TIES merge between Nitral's [Kunocchini-7b](https://huggingface.co/Nitral-AI/Kunocchini-7b-128k-test), Epiculous' [Mika-7B](https://huggingface.co/Epiculous/Mika-7B) and my [FlatErosAlpha](https://huggingface.co/localfultonextractor/FlatErosAlpha), a flattened(in order to keep the vocab size 32000) version of tavtav's [eros-7B-ALPHA](https://huggingface.co/tavtav/eros-7B-ALPHA). In my brief testing, v2 is a significant improvement over the original Erosumika; I guess it won the DARE TIES lottery. Alpaca and Mistral seem to work best. Chat-ML might also work but I expect it to never end generations. Anything goes!
|
||||
|
||||
Due to it being an experimental model, there are some quirks...
|
||||
|
||||
- Rare occasion to misspell words
|
||||
- Very rare occasion to have random formatting artifact at the end of generations
|
||||
|
||||
[GGUF quants](https://huggingface.co/localfultonextractor/Erosumika-7B-v2-GGUF)
|
||||
|
||||
## Limitations and biases
|
||||
The intended use-case for this model is fictional writing for entertainment purposes. Any other sort of usage is out of scope.
|
||||
It may produce socially unacceptable or undesirable text, even if the prompt itself does not include anything explicitly offensive. Outputs might often be factually wrong or misleading.
|
||||
|
||||
|
||||
```yaml
|
||||
base_model: localfultonextractor/FlatErosAlpha
|
||||
models:
|
||||
- model: localfultonextractor/FlatErosAlpha
|
||||
- model: Epiculous/Mika-7B
|
||||
parameters:
|
||||
density: 0.5
|
||||
weight: 0.25
|
||||
- model: Nitral-AI/Kunocchini-7b
|
||||
parameters:
|
||||
density: 0.5
|
||||
weight: 0.75
|
||||
merge_method: dare_ties
|
||||
dtype: bfloat16
|
||||
```
|
||||
2346
imatrix-with-rp-data.txt
Normal file
2346
imatrix-with-rp-data.txt
Normal file
File diff suppressed because it is too large
Load Diff
3
imatrix.dat
Normal file
3
imatrix.dat
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:dd86ff91a511460dd671779b92f0e5bb818f21317fecc931cb7203f8aa289b1c
|
||||
size 4988126
|
||||
Reference in New Issue
Block a user