Files
Buttocks-7B-v1.1/README.md
ModelHub XC d3d62fb59c 初始化项目,由ModelHub XC社区提供模型
Model: TeeZee/Buttocks-7B-v1.1
Source: Original Platform
2026-05-05 19:51:37 +08:00

4.5 KiB

license, tags, model-index
license tags model-index
cc-by-nc-4.0
not-for-all-audiences
merge
name results
Buttocks-7B-v1.1
task dataset metrics source
type name
text-generation Text Generation
name type config split args
AI2 Reasoning Challenge (25-Shot) ai2_arc ARC-Challenge test
num_few_shot
25
type value name
acc_norm 54.61 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/Buttocks-7B-v1.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
HellaSwag (10-Shot) hellaswag validation
num_few_shot
10
type value name
acc_norm 75.61 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/Buttocks-7B-v1.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU (5-Shot) cais/mmlu all test
num_few_shot
5
type value name
acc 50.22 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/Buttocks-7B-v1.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
TruthfulQA (0-shot) truthful_qa multiple_choice validation
num_few_shot
0
type value
mc2 44.72
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/Buttocks-7B-v1.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
Winogrande (5-shot) winogrande winogrande_xl validation
num_few_shot
5
type value name
acc 68.9 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/Buttocks-7B-v1.1 Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
GSM8k (5-shot) gsm8k main test
num_few_shot
5
type value name
acc 5.76 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/Buttocks-7B-v1.1 Open LLM Leaderboard

Buttocks 7B v1.1

An experiment that has gone very, very wrong.

Model details

  • Recreation of the original recipe for Undi95/Toppy-M-7B, but instead of final merge done by mergekit, MergeMoster was used with extended RPG preset.
  • recipe in mergekit-config, stepsAA, BB, CC are the original models with LORAS as per Toppy M 7B sauce.
  • LERP merge method was used

Results

  • in simple terms this model is totally unhinged
  • it always produces sequences similar to fever dreams or drug trips
  • on a good day it can produce scenarios similar to old Monty Python sketches
  • models shows incredible affinity to words like 'ass', 'buttocks', 'farts', prompting with those single words will probably produce a whole story revolving around those topics.

Possible uses

  • to generate dream sequence in a story
  • to make the boring model more unpredictable by merging at low weights with this monster
  • to take a break, connect Silly Tavern to this model and get a few ROTFLs observing how every story deteriorates into pure craziness
  • research on LLM hallucinations

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 49.97
AI2 Reasoning Challenge (25-Shot) 54.61
HellaSwag (10-Shot) 75.61
MMLU (5-Shot) 50.22
TruthfulQA (0-shot) 44.72
Winogrande (5-shot) 68.90
GSM8k (5-shot) 5.76