[Doc] Add Single NPU (Qwen2.5-VL-7B) tutorial (#311)

Run vllm-ascend on Single NPU

What this PR does / why we need it?
Add vllm-ascend tutorial doc for Qwen/Qwen2.5-VL-7B-Instruct model
Inference/Serving doc

Does this PR introduce any user-facing change?
no

How was this patch tested?
no

Signed-off-by: xiemingda <xiemingda1002@gmail.com>
This commit is contained in:
xiemingda
2025-03-12 20:37:12 +08:00
committed by GitHub
parent 7330416de3
commit 59ea23d0d3
2 changed files with 192 additions and 0 deletions

View File

@@ -4,6 +4,7 @@
:caption: Deployment
:maxdepth: 1
single_npu
single_npu_multimodal
multi_npu
multi_node
:::