Signed-off-by: leo-pony <nengjunma@outlook.com>
### What this PR does / why we need it?
Add multi-npu qwen3-MoE-32B Tutorials
Relate RFC: https://github.com/vllm-project/vllm-ascend/issues/1248
- vLLM version: v0.9.1
- vLLM main:
5358cce5ff
---------
Signed-off-by: leo-pony <nengjunma@outlook.com>
210 B
210 B
Tutorials
:::{toctree} :caption: Deployment :maxdepth: 1 single_npu single_npu_multimodal single_npu_audio multi_npu multi_npu_moge multi_npu_qwen3_moe multi_npu_quantization single_node_300i multi_node :::