Files
xc-llm-ascend/docs/source/tutorials/index.md
xiemingda 59ea23d0d3 [Doc] Add Single NPU (Qwen2.5-VL-7B) tutorial (#311)
Run vllm-ascend on Single NPU

What this PR does / why we need it?
Add vllm-ascend tutorial doc for Qwen/Qwen2.5-VL-7B-Instruct model
Inference/Serving doc

Does this PR introduce any user-facing change?
no

How was this patch tested?
no

Signed-off-by: xiemingda <xiemingda1002@gmail.com>
2025-03-12 20:37:12 +08:00

118 B

Tutorials

:::{toctree} :caption: Deployment :maxdepth: 1 single_npu single_npu_multimodal multi_npu multi_node :::