*This model was released on 2025-04-29 and added to Hugging Face Transformers on 2025-03-31.* # Qwen3MoE ## Overview [Qwen3MoE](https://huggingface.co/papers/2505.09388) refers to the mixture of experts model architecture Qwen3-235B-A22B which was released with its dense variant [Qwen3](qwen3) ([blog post](https://qwenlm.github.io/blog/qwen3/)). ### Model Details To be released with the official model launch. ## Usage tips To be released with the official model launch. ## Qwen3MoeConfig [[autodoc]] Qwen3MoeConfig ## Qwen3MoeModel [[autodoc]] Qwen3MoeModel - forward ## Qwen3MoeForCausalLM [[autodoc]] Qwen3MoeForCausalLM - forward ## Qwen3MoeForSequenceClassification [[autodoc]] Qwen3MoeForSequenceClassification - forward ## Qwen3MoeForTokenClassification [[autodoc]] Qwen3MoeForTokenClassification - forward ## Qwen3MoeForQuestionAnswering [[autodoc]] Qwen3MoeForQuestionAnswering - forward