Files

ModelHub XC bbbbfafbb7 初始化项目，由ModelHub XC社区提供模型

Model: giraffe176/Open_Hermes_Orca_Mistral-7B
Source: Original Platform

2026-05-26 18:24:17 +08:00

4.1 KiB

Raw Blame History

license, library_name, tags, base_model, model-index

license

library_name

tags

base_model

model-index

apache-2.0

transformers

mergekit

merge

name

results

Open_Hermes_Orca_Mistral-7B

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	64.68	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/Open_Hermes_Orca_Mistral-7B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	84.63	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/Open_Hermes_Orca_Mistral-7B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	63.93	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/Open_Hermes_Orca_Mistral-7B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	53.34

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/Open_Hermes_Orca_Mistral-7B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	78.45	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/Open_Hermes_Orca_Mistral-7B	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	56.18	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/Open_Hermes_Orca_Mistral-7B	Open LLM Leaderboard

.samplemodel

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the task arithmetic merge method using teknium/OpenHermes-2.5-Mistral-7B as a base.

Models Merged

The following models were included in the merge:

Open-Orca/Mistral-7B-OpenOrca

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: teknium/OpenHermes-2.5-Mistral-7B
    parameters:
      weight: 1.0
  - model: Open-Orca/Mistral-7B-OpenOrca
    parameters:
      weight: 0.6
merge_method: task_arithmetic
base_model: teknium/OpenHermes-2.5-Mistral-7B
dtype: float16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	66.87
AI2 Reasoning Challenge (25-Shot)	64.68
HellaSwag (10-Shot)	84.63
MMLU (5-Shot)	63.93
TruthfulQA (0-shot)	53.34
Winogrande (5-shot)	78.45
GSM8k (5-shot)	56.18

4.1 KiB Raw Blame History

.samplemodel

Merge Details

Merge Method

Models Merged

Configuration

Open LLM Leaderboard Evaluation Results

4.1 KiB

Raw Blame History