Files
HermesNova-Llama-3.1-8B/README.md
ModelHub XC 516bebb612 初始化项目,由ModelHub XC社区提供模型
Model: DevQuasar/HermesNova-Llama-3.1-8B
Source: Original Platform
2026-05-13 07:30:35 +08:00

121 lines
2.9 KiB
Markdown

---
base_model:
- NousResearch/Hermes-3-Llama-3.1-8B
- arcee-ai/Llama-3.1-SuperNova-Lite
library_name: transformers
tags:
- mergekit
- merge
license: llama3.1
model-index:
- name: HermesNova-Llama-3.1-8B
results:
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: bbh
metrics:
- name: acc_norm
type: acc_norm
value: 0.5418
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: gpqa
metrics:
- name: acc_norm
type: acc_norm
value: 0.3365
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: math
metrics:
- name: exact_match
type: exact_match
value: 0.1148
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: mmlu
metrics:
- name: acc_norm
type: acc_norm
value: 0.3729
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: musr
metrics:
- name: acc_norm
type: acc_norm
value: 0.4330
verified: false
- task:
type: text-generation
dataset:
type: lm-evaluation-harness
name: hellaswag
metrics:
- name: acc
type: acc
value: 0.6306512646883091
verified: false
- name: acc_norm
type: acc_norm
value: 0.818263294164509
verified: false
---
[<img src="https://raw.githubusercontent.com/csabakecskemeti/devquasar/main/dq_logo_black-transparent.png" width="200"/>](https://devquasar.com)
'Make knowledge free for everyone'
<a href='https://ko-fi.com/L4L416YX7C' target='_blank'><img height='36' style='border:0px;height:36px;' src='https://storage.ko-fi.com/cdn/kofi6.png?v=6' border='0' alt='Buy Me a Coffee at ko-fi.com' /></a>
# HermesNova
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64e6d37e02dee9bcb9d9fa18/oxkvvhQOju_e5xl6REzNG.jpeg)
The 2 most powerful LLama3.1 model Hermes-3-Llama-3.1-8B and Llama-3.1-SuperNova-Lite merged
# merge
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.
### Models Merged
The following models were included in the merge:
* [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B)
* [arcee-ai/Llama-3.1-SuperNova-Lite](https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: NousResearch/Hermes-3-Llama-3.1-8B
parameters:
weight: 1.0
- model: arcee-ai/Llama-3.1-SuperNova-Lite
parameters:
weight: 1.0
merge_method: linear
dtype: float16
```