1.7 KiB
1.7 KiB
This model was released on 2025-04-29 and added to Hugging Face Transformers on 2025-03-31.
Qwen3MoE
Overview
Qwen3MoE refers to the mixture of experts model architecture Qwen3-235B-A22B which was released with its dense variant Qwen3 (blog post).
Model Details
To be released with the official model launch.
Usage tips
To be released with the official model launch.
Qwen3MoeConfig
autodoc Qwen3MoeConfig
Qwen3MoeModel
autodoc Qwen3MoeModel - forward
Qwen3MoeForCausalLM
autodoc Qwen3MoeForCausalLM - forward
Qwen3MoeForSequenceClassification
autodoc Qwen3MoeForSequenceClassification - forward
Qwen3MoeForTokenClassification
autodoc Qwen3MoeForTokenClassification - forward
Qwen3MoeForQuestionAnswering
autodoc Qwen3MoeForQuestionAnswering - forward