ModelHub XC 4a6de3ccee 初始化项目,由ModelHub XC社区提供模型
Model: YOYO-AI/Qwen3-8B-YOYO-nuslerp-128K
Source: Original Platform
2026-06-02 23:55:15 +08:00

license, language, base_model, pipeline_tag, tags
license language base_model pipeline_tag tags
apache-2.0
en
zh
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
Qwen/Qwen3-8B
text-generation
merge

Model Highlights:

  • merge method: nuslerp

  • Highest precision: dtype: float32 + out_dtype: bfloat16

  • Brand-new chat template: ensures normal operation on LM Studio

  • Context length: 131072

Model Selection Table:

Model Context Uses Basic Model
Qwen3-8B-YOYO-slerp 32K Yes
Qwen3-8B-YOYO-slerp-128K 128K Yes
Qwen3-8B-YOYO-nuslerp 32K No
Qwen3-8B-YOYO-nuslerp-128K 128K No
Qwen3-8B-YOYO-nuslerp-plus 32K Yes
Qwen3-8B-YOYO-nuslerp-plus-128K 128K Yes

Warning

: Models with 128K context may have slight quality loss. In most cases, please use the 32K native context!

Parameter Settings:

Thinking Mode:

Note

Temperature=0.6, TopP=0.95, TopK=20,MinP=0.

Configuration:

The following YAML configuration was used to produce this model:

models:
  - model: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
    parameters:
      weight: 1
  - model: Qwen/Qwen3-8B
    parameters:
      weight: 1
merge_method: nuslerp
tokenizer_source: Qwen/Qwen3-8B
parameters:
  normalize: true
  int8_mask: true
dtype: float32
out_dtype: bfloat16
Description
Model synced from source: YOYO-AI/Qwen3-8B-YOYO-nuslerp-128K
Readme 2.7 MiB
Languages
Text 100%