xc-llm-ascend/index.md at 57a84bb7befeaa0dc62aa35fa406e4d6affbfcca - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

xiemingda 59ea23d0d3 [Doc] Add Single NPU (Qwen2.5-VL-7B) tutorial (#311 )

Run vllm-ascend on Single NPU

What this PR does / why we need it?
Add vllm-ascend tutorial doc for Qwen/Qwen2.5-VL-7B-Instruct model
Inference/Serving doc

Does this PR introduce any user-facing change?
no

How was this patch tested?
no

Signed-off-by: xiemingda <xiemingda1002@gmail.com>

2025-03-12 20:37:12 +08:00

11 lines

118 B

Markdown

Raw Blame History

 # Tutorials
 :::{toctree}
 :caption: Deployment
 :maxdepth: 1
 single_npu
 single_npu_multimodal
 multi_npu
 multi_node
 :::