[Doc] Add Single NPU (Qwen2.5-VL-7B) tutorial (#311)
Run vllm-ascend on Single NPU What this PR does / why we need it? Add vllm-ascend tutorial doc for Qwen/Qwen2.5-VL-7B-Instruct model Inference/Serving doc Does this PR introduce any user-facing change? no How was this patch tested? no Signed-off-by: xiemingda <xiemingda1002@gmail.com>
This commit is contained in:
@@ -4,6 +4,7 @@
|
||||
:caption: Deployment
|
||||
:maxdepth: 1
|
||||
single_npu
|
||||
single_npu_multimodal
|
||||
multi_npu
|
||||
multi_node
|
||||
:::
|
||||
|
||||
Reference in New Issue
Block a user