初始化项目，由ModelHub XC社区提供模型

Model: DevQuasar/HermesNova-Llama-3.1-8B Source: Original Platform
2026-05-13 07:30:35 +08:00
commit 516bebb612
27 changed files with 4915 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,121 @@
+---
+base_model:
+- NousResearch/Hermes-3-Llama-3.1-8B
+- arcee-ai/Llama-3.1-SuperNova-Lite
+library_name: transformers
+tags:
+- mergekit
+- merge
+license: llama3.1
+model-index:
+- name: HermesNova-Llama-3.1-8B
+  results:
+  - task:
+      type: text-generation
+    dataset:
+      type: lm-evaluation-harness
+      name: bbh
+    metrics:
+    - name: acc_norm
+      type: acc_norm
+      value: 0.5418
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: lm-evaluation-harness
+      name: gpqa
+    metrics:
+    - name: acc_norm
+      type: acc_norm
+      value: 0.3365
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: lm-evaluation-harness
+      name: math
+    metrics:
+    - name: exact_match
+      type: exact_match
+      value: 0.1148
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: lm-evaluation-harness
+      name: mmlu
+    metrics:
+    - name: acc_norm
+      type: acc_norm
+      value: 0.3729
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: lm-evaluation-harness
+      name: musr
+    metrics:
+    - name: acc_norm
+      type: acc_norm
+      value: 0.4330
+      verified: false
+  - task:
+      type: text-generation
+    dataset:
+      type: lm-evaluation-harness
+      name: hellaswag
+    metrics:
+    - name: acc
+      type: acc
+      value: 0.6306512646883091
+      verified: false
+    - name: acc_norm
+      type: acc_norm
+      value: 0.818263294164509
+      verified: false
+---
+[<img src="https://raw.githubusercontent.com/csabakecskemeti/devquasar/main/dq_logo_black-transparent.png" width="200"/>](https://devquasar.com)
+
+'Make knowledge free for everyone'
+
+<a href='https://ko-fi.com/L4L416YX7C' target='_blank'><img height='36' style='border:0px;height:36px;' src='https://storage.ko-fi.com/cdn/kofi6.png?v=6' border='0' alt='Buy Me a Coffee at ko-fi.com' /></a>
+
+# HermesNova
+
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64e6d37e02dee9bcb9d9fa18/oxkvvhQOju_e5xl6REzNG.jpeg)
+
+The 2 most powerful LLama3.1 model Hermes-3-Llama-3.1-8B and Llama-3.1-SuperNova-Lite merged
+
+
+# merge
+
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+
+## Merge Details
+### Merge Method
+
+This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.
+
+### Models Merged
+
+The following models were included in the merge:
+* [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B)
+* [arcee-ai/Llama-3.1-SuperNova-Lite](https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite)
+
+### Configuration
+
+The following YAML configuration was used to produce this model:
+
+```yaml
+models:
+  - model: NousResearch/Hermes-3-Llama-3.1-8B
+    parameters:
+      weight: 1.0
+  - model: arcee-ai/Llama-3.1-SuperNova-Lite
+    parameters:
+      weight: 1.0
+merge_method: linear
+dtype: float16
+
+```