Files
xc-llm-ascend/docs/source/tutorials/index.md
luluxiu520 bc05a81bf2 Add Qwen3-VL-235B-A22B-Instruct tutorials (#5167)
### What this PR does / why we need it?

This PR provides an introduction to the Qwen3-VL-235B-A22B-Instruct
model, details on the features supported by the model in the current
version, the model deployment process, as well as methods for
performance testing and accuracy testing.

With this document, the deployment and testing of the
Qwen3-VL-235B-A22B-Instruct model can be implemented more easily.

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

Signed-off-by: luluxiu520 <l2625793@outlook.com>
2025-12-19 14:56:17 +08:00

28 lines
430 B
Markdown

# Tutorials
:::{toctree}
:caption: Deployment
:maxdepth: 1
Qwen2.5-Omni.md
Qwen2.5-7B
Qwen3-Dense
Qwen-VL-Dense.md
Qwen3-30B-A3B.md
Qwen3-235B-A22B.md
Qwen3-VL-235B-A22B-Instruct.md
Qwen3-Coder-30B-A3B
Qwen3_embedding
Qwen3_reranker
Qwen3-8B-W4A8
Qwen3-32B-W4A4
Qwen3-Next
DeepSeek-V3.1.md
DeepSeek-V3.2.md
DeepSeek-R1.md
Kimi-K2-Thinking
pd_disaggregation_mooncake_single_node
pd_disaggregation_mooncake_multi_node
ray
310p
:::