[Tutorial] Add qwen3 8b w4a8 tutorial (#2249)

### What this PR does / why we need it?

Add a new single npu quantization tutorial, and using the latest qwen3
model.

- vLLM version: v0.10.0
- vLLM main:
8e8e0b6af1

Signed-off-by: 22dimensions <waitingwind@foxmail.com>
This commit is contained in:
22dimensions
2025-08-07 14:39:38 +08:00
committed by GitHub
parent bcd0b532f5
commit 440d28a138
2 changed files with 132 additions and 0 deletions

View File

@@ -7,6 +7,7 @@ single_npu
single_npu_multimodal
single_npu_audio
single_npu_qwen3_embedding
single_npu_qwen3_quantization
multi_npu
multi_npu_moge
multi_npu_qwen3_moe