[Doc] Add Single NPU (Qwen2.5-VL-7B) tutorial (#311)

Run vllm-ascend on Single NPU What this PR does / why we need it? Add vllm-ascend tutorial doc for Qwen/Qwen2.5-VL-7B-Instruct model Inference/Serving doc Does this PR introduce any user-facing change? no How was this patch tested? no Signed-off-by: xiemingda <xiemingda1002@gmail.com>
2025-03-12 20:37:12 +08:00
parent 7330416de3
commit 59ea23d0d3
2 changed files with 192 additions and 0 deletions
--- a/docs/source/tutorials/index.md
+++ b/docs/source/tutorials/index.md
@@ -4,6 +4,7 @@
 :caption: Deployment
 :maxdepth: 1
 single_npu
+single_npu_multimodal
 multi_npu
 multi_node
 :::