初始化项目，由ModelHub XC社区提供模型

Model: DavidAU/LFM2.5-1.2B-MEGABRAIN2-Thinking-Kimi-V2-DISTILL Source: Original Platform
2026-06-01 05:24:24 +08:00
commit c2435ef0cc
9 changed files with 328215 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -0,0 +1,35 @@
 *.7z filter=lfs diff=lfs merge=lfs -text
 *.arrow filter=lfs diff=lfs merge=lfs -text
 *.bin filter=lfs diff=lfs merge=lfs -text
 *.bz2 filter=lfs diff=lfs merge=lfs -text
 *.ckpt filter=lfs diff=lfs merge=lfs -text
 *.ftz filter=lfs diff=lfs merge=lfs -text
 *.gz filter=lfs diff=lfs merge=lfs -text
 *.h5 filter=lfs diff=lfs merge=lfs -text
 *.joblib filter=lfs diff=lfs merge=lfs -text
 *.lfs.* filter=lfs diff=lfs merge=lfs -text
 *.mlmodel filter=lfs diff=lfs merge=lfs -text
 *.model filter=lfs diff=lfs merge=lfs -text
 *.msgpack filter=lfs diff=lfs merge=lfs -text
 *.npy filter=lfs diff=lfs merge=lfs -text
 *.npz filter=lfs diff=lfs merge=lfs -text
 *.onnx filter=lfs diff=lfs merge=lfs -text
 *.ot filter=lfs diff=lfs merge=lfs -text
 *.parquet filter=lfs diff=lfs merge=lfs -text
 *.pb filter=lfs diff=lfs merge=lfs -text
 *.pickle filter=lfs diff=lfs merge=lfs -text
 *.pkl filter=lfs diff=lfs merge=lfs -text
 *.pt filter=lfs diff=lfs merge=lfs -text
 *.pth filter=lfs diff=lfs merge=lfs -text
 *.rar filter=lfs diff=lfs merge=lfs -text
 *.safetensors filter=lfs diff=lfs merge=lfs -text
 saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.tar.* filter=lfs diff=lfs merge=lfs -text
 *.tar filter=lfs diff=lfs merge=lfs -text
 *.tflite filter=lfs diff=lfs merge=lfs -text
 *.tgz filter=lfs diff=lfs merge=lfs -text
 *.wasm filter=lfs diff=lfs merge=lfs -text
 *.xz filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
--- a/README.md
+++ b/README.md
@@ -0,0 +1,121 @@
 ---
 license: apache-2.0
 language:
 - en
 base_model:
 - DavidAU/LFM2.5-1.2B-MEGABRAIN-Thinking-Polaris-ClaudeHOPUS-Deepseek-GLM
 datasets:
 - TeichAI/kimi-k2-thinking-1000x
 pipeline_tag: text-generation
 library_name: transformers
 tags:
 - unsloth
 - finetune
 - All use cases
 - bfloat16
 - creative
 - creative writing
 - fiction writing
 - plot generation
 - sub-plot generation
 - fiction writing
 - story generation
 - scene continue
 - storytelling
 - fiction story
 - science fiction
 - romance
 - all genres
 - story
 - writing
 - vivid prosing
 - vivid writing
 - fiction
 ---
 <h2>LFM2.5-1.2B-MEGABRAIN2-Thinking-Kimi-V2-DISTILL</h2>
 This is a full deep thinking DavidAU/LFM2.5-1.2B-MEGABRAIN-Thinking-Polaris-ClaudeHOPUS-Deepseek-GLM (LFM2.5-1.2B Thinking base) 
 fine tune using Kimi V2 reasoning dataset via Unsloth via local hardware, Linux (for windows) 
 at 16 bit precision. The thinking / reasoning was completely replaced.
 Reasoning is compact, but detailed (very detailed) and right to the "point" so to speak.
 Reasoning affects:
 - General model operation.
 - Output generation
 - Benchmarks.
 Model Features:
 - 128k context
 - Temp range .1 to 2.5.
 - Reasoning is temp stable.
 IMPORTANT SETTINGS/QUANTS:
 - Strongly suggest q5,q6, q8 or 16 bit precision OR Imatrix IQ3_M min.
 - Rep pen 1.05 to 1.1 .
 Enjoy the freedom!
 <B>BENCHMARKS:</B>
 ```
 ARC-Challenge | ARC-Easy | BoolQ | Hellaswag | OpenBookQA | PIQA  | Winogrande
 0.359           0.464      0.748   0.505       0.372        0.702   0.535
 ```
 VS "Normal LFM2.5"
 ```
 ARC-Challenge | ARC-Easy | BoolQ | Hellaswag | OpenBookQA | PIQA  | Winogrande
 0.365           0.426      0.717   0.486       0.382        0.687   0.538
 ```
 ---
 <B>SPECIAL THANKS TO:</B>
 - Team "TeichAI" for the excellent dataset.
 - Team "Unsloth" for making the training painless.
 - Team "Nightmedia" for Benchmarks and co-labing.
 ---
 <B>Settings: CHAT / ROLEPLAY and/or SMOOTHER operation of this model:</B>
 In "KoboldCpp" or  "oobabooga/text-generation-webui" or "Silly Tavern" ;
 Set the "Smoothing_factor" to 1.5 
 : in KoboldCpp -> Settings->Samplers->Advanced-> "Smooth_F"
 : in text-generation-webui -> parameters -> lower right.
 : In Silly Tavern this is called: "Smoothing"
 NOTE: For "text-generation-webui" 
 -> if using GGUFs you need to use "llama_HF" (which involves downloading some config files from the SOURCE version of this model)
 Source versions (and config files) of my models are here:
 https://huggingface.co/collections/DavidAU/d-au-source-files-for-gguf-exl2-awq-gptq-hqq-etc-etc-66b55cb8ba25f914cbf210be
 OTHER OPTIONS:
 - Increase rep pen to 1.1 to 1.15 (you don't need to do this if you use "smoothing_factor")
 - If the interface/program you are using to run AI MODELS supports "Quadratic Sampling" ("smoothing") just make the adjustment as noted.
 <B>Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers</B>
 This a "Class 1" model:
 For all settings used for this model (including specifics for its "class"), including example generation(s) and for advanced settings guide (which many times addresses any model issue(s)), including methods to improve model performance for all use case(s) as well as chat, roleplay and other use case(s) please see:
 [ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]
 You can see all parameters used for generation, in addition to advanced parameters and samplers to get the most out of this model here:
 [ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]
--- a/chat_template.jinja
+++ b/chat_template.jinja
@@ -0,0 +1,45 @@
 {{- bos_token -}}
 {%- set keep_past_thinking = keep_past_thinking | default(false) -%}
 {%- set ns = namespace(system_prompt="") -%}
 {%- if messages[0]["role"] == "system" -%}
    {%- set ns.system_prompt = messages[0]["content"] -%}
    {%- set messages = messages[1:] -%}
 {%- endif -%}
 {%- if tools -%}
    {%- set ns.system_prompt = ns.system_prompt + ("\n" if ns.system_prompt else "") + "List of tools: [" -%}
    {%- for tool in tools -%}
        {%- if tool is not string -%}
            {%- set tool = tool | tojson -%}
        {%- endif -%}
        {%- set ns.system_prompt = ns.system_prompt + tool -%}
        {%- if not loop.last -%}
            {%- set ns.system_prompt = ns.system_prompt + ", " -%}
        {%- endif -%}
    {%- endfor -%}
    {%- set ns.system_prompt = ns.system_prompt + "]" -%}
 {%- endif -%}
 {%- if ns.system_prompt -%}
    {{- "<|im_start|>system\n" + ns.system_prompt + "<|im_end|>\n" -}}
 {%- endif -%}
 {%- set ns.last_assistant_index = -1 -%}
 {%- for message in messages -%}
    {%- if message["role"] == "assistant" -%}
        {%- set ns.last_assistant_index = loop.index0 -%}
    {%- endif -%}
 {%- endfor -%}
 {%- for message in messages -%}
    {{- "<|im_start|>" + message["role"] + "\n" -}}
    {%- set content = message["content"] -%}
    {%- if content is not string -%}
        {%- set content = content | tojson -%}
    {%- endif -%}
    {%- if message["role"] == "assistant" and not keep_past_thinking and loop.index0 != ns.last_assistant_index -%}
        {%- if "</think>" in content -%}
            {%- set content = content.split("</think>")[-1] | trim -%}
        {%- endif -%}
    {%- endif -%}
    {{- content + "<|im_end|>\n" -}}
 {%- endfor -%}
 {%- if add_generation_prompt -%}
    {{- "<|im_start|>assistant\n" -}}
 {%- endif -%}
--- a/config.json
+++ b/config.json
@@ -0,0 +1,57 @@
 {
  "architectures": [
    "Lfm2ForCausalLM"
  ],
  "block_auto_adjust_ff_dim": true,
  "block_dim": 2048,
  "block_ff_dim": 12288,
  "block_ffn_dim_multiplier": 1.0,
  "block_mlp_init_scale": 1.0,
  "block_multiple_of": 256,
  "block_norm_eps": 1e-05,
  "block_out_init_scale": 1.0,
  "block_use_swiglu": true,
  "block_use_xavier_init": true,
  "bos_token_id": 1,
  "conv_L_cache": 3,
  "conv_bias": false,
  "conv_dim": 2048,
  "conv_use_xavier_init": true,
  "dtype": "bfloat16",
  "eos_token_id": 7,
  "hidden_size": 2048,
  "initializer_range": 0.02,
  "intermediate_size": 12288,
  "layer_types": [
    "conv",
    "conv",
    "full_attention",
    "conv",
    "conv",
    "full_attention",
    "conv",
    "conv",
    "full_attention",
    "conv",
    "full_attention",
    "conv",
    "full_attention",
    "conv",
    "full_attention",
    "conv"
  ],
  "max_position_embeddings": 128000,
  "model_type": "lfm2",
  "norm_eps": 1e-05,
  "num_attention_heads": 32,
  "num_heads": 32,
  "num_hidden_layers": 16,
  "num_key_value_heads": 8,
  "pad_token_id": 0,
  "rope_theta": 1000000.0,
  "tie_embedding": true,
  "transformers_version": "4.57.6",
  "use_cache": true,
  "use_pos_enc": true,
  "vocab_size": 65536
 }
--- a/generation_config.json
+++ b/generation_config.json
@@ -0,0 +1,7 @@
 {
  "_from_model_config": true,
  "bos_token_id": 1,
  "eos_token_id": 7,
  "pad_token_id": 0,
  "transformers_version": "4.57.6"
 }
--- a/model.safetensors
+++ b/model.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:f360f9f8b2bb9409b9465a91297a591966cc1f236a08fc5b77d7b54854c736de
 size 2340697936
--- a/special_tokens_map.json
+++ b/special_tokens_map.json
@@ -0,0 +1,23 @@
 {
  "bos_token": {
    "content": "<|startoftext|>",
    "lstrip": false,
    "normalized": false,
    "rstrip": false,
    "single_word": false
  },
  "eos_token": {
    "content": "<|im_end|>",
    "lstrip": false,
    "normalized": false,
    "rstrip": false,
    "single_word": false
  },
  "pad_token": {
    "content": "<|pad|>",
    "lstrip": false,
    "normalized": false,
    "rstrip": false,
    "single_word": false
  }
 }
--- a/tokenizer.json
+++ b/tokenizer.json
--- a/tokenizer_config.json
+++ b/tokenizer_config.json