Files
MoE-Qwen-4x1.8B-pretrain-50…/configuration.json

1 line
73 B
JSON
Raw Normal View History