### What this PR does / why we need it?
Add doc for Qwen3 Next
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
Doc CI passed
Related: https://github.com/vllm-project/vllm-ascend/issues/2884
- vLLM version: v0.10.2
- vLLM main:
01413e0cf5
Signed-off-by: Yikun Jiang <yikunkero@gmail.com>
333 B
333 B
Tutorials
:::{toctree} :caption: Deployment :maxdepth: 1 single_npu single_npu_multimodal single_npu_audio single_npu_qwen3_embedding single_npu_qwen3_quantization multi_npu_qwen3_next multi_npu multi_npu_moge multi_npu_qwen3_moe multi_npu_quantization single_node_300i multi_node multi_node_kimi multi_node_pd_disaggregation :::