Files
xc-llm-ascend/docs/source/tutorials/index.md
wind-all 1a443f2772 add multi_npu_qwen3_dense tutorials (#4543)
### What this PR does / why we need it?

This PR adds tutorials for the Qwen3-Dense series models, including the
A2 and A3 series, and provides accuracy validation results.



- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

---------

Signed-off-by: wind-all <anyuting@h-partners.com>
2025-12-10 16:09:56 +08:00

575 B

Tutorials

:::{toctree} :caption: Deployment :maxdepth: 1 single_npu single_npu_qwen2.5_vl single_npu_qwen2_audio single_npu_qwen3_embedding single_npu_qwen3_quantization single_npu_qwen3_w4a4 single_node_pd_disaggregation_mooncake multi_npu_qwen3_next multi_npu multi_npu_kimi-k2-thinking multi_npu_moge Qwen3-Dense multi_npu_qwen3_moe multi_npu_quantization single_node_300i DeepSeek-V3.1.md DeepSeek-V3.2-Exp.md Qwen3-235B-A22B.md Qwen3-Coder-30B-A3B multi_node multi_node_kimi multi_node_qwen3vl multi_node_pd_disaggregation_mooncake multi_node_ray Qwen2.5-Omni.md :::