library_name, tags, base_model, model-index
library_name tags base_model model-index
transformers
mergekit
merge
HuggingFaceTB/SmolLM2-360M
HuggingFaceTB/SmolLM2-360M-Instruct
name results
SmolLM2-360M-Merged
task dataset metrics source
type name
text-generation Text Generation
name type args
IFEval (0-Shot) HuggingFaceH4/ifeval
num_few_shot
0
type value name
inst_level_strict_acc and prompt_level_strict_acc 32.06 strict accuracy
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=vonjack/SmolLM2-360M-Merged Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type args
BBH (3-Shot) BBH
num_few_shot
3
type value name
acc_norm 4.74 normalized accuracy
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=vonjack/SmolLM2-360M-Merged Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type args
MATH Lvl 5 (4-Shot) hendrycks/competition_math
num_few_shot
4
type value name
exact_match 0.76 exact match
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=vonjack/SmolLM2-360M-Merged Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type args
GPQA (0-shot) Idavidrein/gpqa
num_few_shot
0
type value name
acc_norm 0.78 acc_norm
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=vonjack/SmolLM2-360M-Merged Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type args
MuSR (0-shot) TAUR-Lab/MuSR
num_few_shot
0
type value name
acc_norm 3.36 acc_norm
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=vonjack/SmolLM2-360M-Merged Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU-PRO (5-shot) TIGER-Lab/MMLU-Pro main test
num_few_shot
5
type value name
acc 1.09 accuracy
url name
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=vonjack/SmolLM2-360M-Merged Open LLM Leaderboard

SmolLM2-360M-Merged

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the TIES merge method using HuggingFaceTB/SmolLM2-360M as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: HuggingFaceTB/SmolLM2-360M-Instruct
    parameters:
      weight: 1
merge_method: ties
base_model: HuggingFaceTB/SmolLM2-360M
parameters:
  normalize: true
  int8_mask: true
dtype: bfloat16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 7.13
IFEval (0-Shot) 32.06
BBH (3-Shot) 4.74
MATH Lvl 5 (4-Shot) 0.76
GPQA (0-shot) 0.78
MuSR (0-shot) 3.36
MMLU-PRO (5-shot) 1.09
Description
Model synced from source: vonjack/SmolLM2-360M-Merged
Readme 1.3 MiB
Languages
Text 100%