2024-12-27 02:08:38 +00:00
---
2024-12-27 10:15:38 +08:00
base_model:
- inflatebot/helide-beta-r0
- inflatebot/helide-beta-r4
- Sao10K/L3-8B-Stheno-v3.2
- inflatebot/helide-beta-r1
library_name: transformers
tags:
- mergekit
- merge
2024-12-27 02:08:38 +00:00
2024-12-27 10:15:38 +08:00
---
# helium-3-r5
2024-12-27 02:08:38 +00:00
2024-12-27 10:15:38 +08:00
This is a merge of pre-trained language models created using [mergekit ](https://github.com/cg123/mergekit ).
2024-12-27 02:08:38 +00:00
2024-12-27 10:15:38 +08:00
## Merge Details
2024-12-27 02:08:38 +00:00
2024-12-27 10:15:38 +08:00
Helium3, but the base is Stheno. A bit stupid. If Helium3-baseLlama is too dry and regular Helium3 is too horny, this *might* be a good middle ground.
Emphasis on "might."
2024-12-27 02:08:38 +00:00
2024-12-27 10:15:38 +08:00
[GGUFs by mradermacher ](https://huggingface.co/mradermacher/L3-8B-Helium3-baseStheno-GGUF )
2024-12-27 02:08:38 +00:00
2024-12-27 10:15:38 +08:00
### Merge Method
2024-12-27 02:08:38 +00:00
2024-12-27 10:15:38 +08:00
This model was merged using the [Model Stock ](https://arxiv.org/abs/2403.19522 ) merge method using [Sao10K/L3-8B-Stheno-v3.2 ](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2 ) as a base.
### Models Merged
2024-12-27 02:08:38 +00:00
2024-12-27 10:15:38 +08:00
The following models were included in the merge:
* [inflatebot/helide-beta-r0 ](https://huggingface.co/inflatebot/helide-beta-r0 )
* [inflatebot/helide-beta-r4 ](https://huggingface.co/inflatebot/helide-beta-r4 )
* [inflatebot/helide-beta-r1 ](https://huggingface.co/inflatebot/helide-beta-r1 )
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: inflatebot/helide-beta-r4
- model: inflatebot/helide-beta-r1
- model: inflatebot/helide-beta-r0
merge_method: model_stock
base_model: Sao10K/L3-8B-Stheno-v3.2
dtype: bfloat16
```