language, license, library_name, tags, base_model, datasets, model_name, pipeline_tag, inference, model_creator, quantized_by, model-index
language
license
library_name
tags
base_model
datasets
model_name
pipeline_tag
inference
model_creator
quantized_by
model-index
apache-2.0
transformers
chat
qwen
qwen2
finetune
chatml
OpenHermes-2.5
HelpSteer2
Orca
SlimOrca
Qwen/Qwen2-7B
nvidia/HelpSteer2
teknium/OpenHermes-2.5
microsoft/orca-math-word-problems-200k
Open-Orca/SlimOrca
calme-2.2-qwen2-7b
text-generation
false
MaziyarPanahi
MaziyarPanahi
name
results
calme-2.2-qwen2-7b
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
IFEval (0-Shot)
HuggingFaceH4/ifeval
type
value
name
inst_level_strict_acc and prompt_level_strict_acc
35.97
strict accuracy
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
BBH (3-Shot)
BBH
type
value
name
acc_norm
33.11
normalized accuracy
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
MATH Lvl 5 (4-Shot)
hendrycks/competition_math
type
value
name
exact_match
19.34
exact match
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
GPQA (0-shot)
Idavidrein/gpqa
type
value
name
acc_norm
5.48
acc_norm
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
MuSR (0-shot)
TAUR-Lab/MuSR
type
value
name
acc_norm
13.28
acc_norm
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
config
split
args
MMLU-PRO (5-shot)
TIGER-Lab/MMLU-Pro
main
test
type
value
name
acc
32.21
accuracy
MaziyarPanahi/calme-2.2-qwen2-7b
This is a fine-tuned version of the Qwen/Qwen2-7B model. It aims to improve the base model across all benchmarks.
⚡ Quantized GGUF
All GGUF models are available here: MaziyarPanahi/calme-2.2-qwen2-7b-GGUF
Detailed results can be found here
Metric
Value
Avg.
23.23
IFEval (0-Shot)
35.97
BBH (3-Shot)
33.11
MATH Lvl 5 (4-Shot)
19.34
GPQA (0-shot)
5.48
MuSR (0-shot)
13.28
MMLU-PRO (5-shot)
32.21
Prompt Template
This model uses ChatML prompt template:
How to use