add multi_npu_qwen3_dense tutorials (#4543)
### What this PR does / why we need it?
This PR adds tutorials for the Qwen3-Dense series models, including the
A2 and A3 series, and provides accuracy validation results.
- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c
---------
Signed-off-by: wind-all <anyuting@h-partners.com>
This commit is contained in:
@@ -14,6 +14,7 @@ multi_npu_qwen3_next
|
||||
multi_npu
|
||||
multi_npu_kimi-k2-thinking
|
||||
multi_npu_moge
|
||||
Qwen3-Dense
|
||||
multi_npu_qwen3_moe
|
||||
multi_npu_quantization
|
||||
single_node_300i
|
||||
|
||||
Reference in New Issue
Block a user