2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00
2024-11-27 15:16:32 +08:00

license, library_name, tags, base_model, license_name, license_link, model-index
license library_name tags base_model license_name license_link model-index
other transformers
generated_from_trainer
Qwen/Qwen2.5-3B qwen-research https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE
name results
outputs/gelato-3b

Prompt Format: ChatML

This is an experimental which was heavily optimized for reasoning tasks and not meant for production-use.

GGUFs: https://huggingface.co/mradermacher/raspberry-3B-GGUF

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 15.40
IFEval (0-Shot) 31.54
BBH (3-Shot) 19.53
MATH Lvl 5 (4-Shot) 7.63
GPQA (0-shot) 3.69
MuSR (0-shot) 9.41
MMLU-PRO (5-shot) 20.60
Description
Model synced from source: arcee-ai/raspberry-3B
Readme 2 MiB