Files
xc-llm-ascend/docs/source/tutorials/index.md
SILONG ZENG ff7d703192 [Doc]Add tutorial document for qwen-VL-Dense (#3516)
### What this PR does / why we need it?
This document employs the qwen3-vl-8b model and qwen2.5-vl-32b to
demonstrate the primary verification steps for the Qwen-VL series dense
models, including supported features, feature configuration, environment
preparation, NPU deployment, and accuracy and performance evaluation.

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

---------

Signed-off-by: MrZ20 <2609716663@qq.com>
2025-12-11 08:55:23 +08:00

33 lines
585 B
Markdown

# Tutorials
:::{toctree}
:caption: Deployment
:maxdepth: 1
single_npu
Qwen-VL-Dense.md
single_npu_qwen2_audio
single_npu_qwen3_embedding
single_npu_qwen3_quantization
single_npu_qwen3_w4a4
single_node_pd_disaggregation_mooncake
multi_npu_qwen3_next
multi_npu
multi_npu_kimi-k2-thinking
multi_npu_moge
Qwen3-Dense
multi_npu_qwen3_moe
multi_npu_quantization
single_node_300i
DeepSeek-R1.md
DeepSeek-V3.1.md
DeepSeek-V3.2-Exp.md
Qwen3-235B-A22B.md
Qwen3-Coder-30B-A3B
multi_node
multi_node_kimi
multi_node_qwen3vl
multi_node_pd_disaggregation_mooncake
multi_node_ray
Qwen2.5-Omni.md
:::