Signed-off-by: leo-pony <nengjunma@outlook.com>
### What this PR does / why we need it?
Add multi-npu qwen3-MoE-32B Tutorials
Relate RFC: https://github.com/vllm-project/vllm-ascend/issues/1248
- vLLM version: v0.9.1
- vLLM main:
5358cce5ff
---------
Signed-off-by: leo-pony <nengjunma@outlook.com>
16 lines
210 B
Markdown
16 lines
210 B
Markdown
# Tutorials
|
|
|
|
:::{toctree}
|
|
:caption: Deployment
|
|
:maxdepth: 1
|
|
single_npu
|
|
single_npu_multimodal
|
|
single_npu_audio
|
|
multi_npu
|
|
multi_npu_moge
|
|
multi_npu_qwen3_moe
|
|
multi_npu_quantization
|
|
single_node_300i
|
|
multi_node
|
|
:::
|