Files
Qwen1.5-MOE-sft-math7k-dens…/generation_config.json
ModelHub XC 8b6e8f053d 初始化项目,由ModelHub XC社区提供模型
Model: xd2010/Qwen1.5-MOE-sft-math7k-densemixer
Source: Original Platform
2026-05-15 21:29:47 +08:00

12 lines
207 B
JSON

{
"attn_implementation": "flash_attention_2",
"bos_token_id": 151643,
"eos_token_id": [
151645,
151643
],
"pad_token_id": 151643,
"transformers_version": "4.51.0",
"use_cache": false
}