library_name, tags, base_model, model-index
library_name
tags
base_model
model-index
transformers
bunnycore/Qwen-2.5-7B-Deep-Stock-v4
bunnycore/Qwen-2.5-7b-s1k-lora_model
name
results
Qwen-2.5-7b-S1k
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
IFEval (0-Shot)
HuggingFaceH4/ifeval
type
value
name
inst_level_strict_acc and prompt_level_strict_acc
71.62
strict accuracy
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
BBH (3-Shot)
BBH
type
value
name
acc_norm
36.69
normalized accuracy
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
MATH Lvl 5 (4-Shot)
hendrycks/competition_math
type
value
name
exact_match
47.81
exact match
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
GPQA (0-shot)
Idavidrein/gpqa
type
value
name
acc_norm
4.59
acc_norm
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
args
MuSR (0-shot)
TAUR-Lab/MuSR
type
value
name
acc_norm
9.26
acc_norm
task
dataset
metrics
source
type
name
text-generation
Text Generation
name
type
config
split
args
MMLU-PRO (5-shot)
TIGER-Lab/MMLU-Pro
main
test
type
value
name
acc
37.58
accuracy
System Prompt
Configuration
The following YAML configuration was used to produce this model:
Detailed results can be found here
Metric
Value
Avg.
34.59
IFEval (0-Shot)
71.62
BBH (3-Shot)
36.69
MATH Lvl 5 (4-Shot)
47.81
GPQA (0-shot)
4.59
MuSR (0-shot)
9.26
MMLU-PRO (5-shot)
37.58