language, license, tags, datasets, license_name, license_link, pipeline_tag, model-index
language
license
tags
datasets
license_name
license_link
pipeline_tag
model-index
other
helpingai
LICENSE.md
text-generation
name
results
vortex-3b
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
config
split
args
AI2 Reasoning Challenge (25-Shot)
ai2_arc
ARC-Challenge
test
type
value
name
acc_norm
31.91
normalized accuracy
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
split
args
HellaSwag (10-Shot)
hellaswag
validation
type
value
name
acc_norm
56.89
normalized accuracy
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
config
split
args
MMLU (5-Shot)
cais/mmlu
all
test
type
value
name
acc
27.32
accuracy
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
config
split
args
TruthfulQA (0-shot)
truthful_qa
multiple_choice
validation
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
config
split
args
Winogrande (5-shot)
winogrande
winogrande_xl
validation
type
value
name
acc
60.14
accuracy
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
config
split
args
GSM8k (5-shot)
gsm8k
main
test
type
value
name
acc
0.91
accuracy
Model Overview
vortex-3b is a 2.78 billion parameter causal language model created by OEvortex that is derived from EleutherAI's Pythia-2.8b and fine-tuned on Vortex-50k dataset'
Detailed results can be found here
Metric
vortex 3b
vortex 3b-v2
dolly-v2-3b
pythia-2.8b-deduped
Avg.
35.76
37.46
25.26
36.72
AI2 Reasoning Challenge (25-Shot)
31.91
39.68
22.83
36.26
HellaSwag (10-Shot)
56.89
65.04
26.55
60.66
MMLU (5-Shot)
27.32
25.09
24.7
26.78
TruthfulQA (0-shot)
37.39
33.80
0
35.56
Winogrande (5-shot)
60.14
59.12
59.43
60.22
GSM8k (5-shot)
0.91
2.05
1.86
0.83