HermesNova-Llama-3.1-8B/README.md

---
base_model:
- NousResearch/Hermes-3-Llama-3.1-8B
- arcee-ai/Llama-3.1-SuperNova-Lite
library_name: transformers
tags:
- mergekit
- merge
license: llama3.1
model-index:
- name: HermesNova-Llama-3.1-8B
  results:
  - task:
      type: text-generation
    dataset:
      type: lm-evaluation-harness
      name: bbh
    metrics:
    - name: acc_norm
      type: acc_norm
      value: 0.5418
      verified: false
  - task:
      type: text-generation
    dataset:
      type: lm-evaluation-harness
      name: gpqa
    metrics:
    - name: acc_norm
      type: acc_norm
      value: 0.3365
      verified: false
  - task:
      type: text-generation
    dataset:
      type: lm-evaluation-harness
      name: math
    metrics:
    - name: exact_match
      type: exact_match
      value: 0.1148
      verified: false
  - task:
      type: text-generation
    dataset:
      type: lm-evaluation-harness
      name: mmlu
    metrics:
    - name: acc_norm
      type: acc_norm
      value: 0.3729
      verified: false
  - task:
      type: text-generation
    dataset:
      type: lm-evaluation-harness
      name: musr
    metrics:
    - name: acc_norm
      type: acc_norm
      value: 0.4330
      verified: false
  - task:
      type: text-generation
    dataset:
      type: lm-evaluation-harness
      name: hellaswag
    metrics:
    - name: acc
      type: acc
      value: 0.6306512646883091
      verified: false
    - name: acc_norm
      type: acc_norm
      value: 0.818263294164509
      verified: false
---
[<img src="https://raw.githubusercontent.com/csabakecskemeti/devquasar/main/dq_logo_black-transparent.png" width="200"/>](https://devquasar.com)

'Make knowledge free for everyone'

<a href='https://ko-fi.com/L4L416YX7C' target='_blank'><img height='36' style='border:0px;height:36px;' src='https://storage.ko-fi.com/cdn/kofi6.png?v=6' border='0' alt='Buy Me a Coffee at ko-fi.com' /></a>

# HermesNova

![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64e6d37e02dee9bcb9d9fa18/oxkvvhQOju_e5xl6REzNG.jpeg)

The 2 most powerful LLama3.1 model Hermes-3-Llama-3.1-8B and Llama-3.1-SuperNova-Lite merged


# merge

This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

## Merge Details
### Merge Method

This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.

### Models Merged

The following models were included in the merge:
* [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B)
* [arcee-ai/Llama-3.1-SuperNova-Lite](https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite)

### Configuration

The following YAML configuration was used to produce this model:

```yaml
models:
  - model: NousResearch/Hermes-3-Llama-3.1-8B
    parameters:
      weight: 1.0
  - model: arcee-ai/Llama-3.1-SuperNova-Lite
    parameters:
      weight: 1.0
merge_method: linear
dtype: float16

```