### What this PR does / why we need it?
This PR provides an introduction to the Qwen3-VL-235B-A22B-Instruct
model, details on the features supported by the model in the current
version, the model deployment process, as well as methods for
performance testing and accuracy testing.
With this document, the deployment and testing of the
Qwen3-VL-235B-A22B-Instruct model can be implemented more easily.
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c
Signed-off-by: luluxiu520 <l2625793@outlook.com>
430 B
430 B
Tutorials
:::{toctree} :caption: Deployment :maxdepth: 1 Qwen2.5-Omni.md Qwen2.5-7B Qwen3-Dense Qwen-VL-Dense.md Qwen3-30B-A3B.md Qwen3-235B-A22B.md Qwen3-VL-235B-A22B-Instruct.md Qwen3-Coder-30B-A3B Qwen3_embedding Qwen3_reranker Qwen3-8B-W4A8 Qwen3-32B-W4A4 Qwen3-Next DeepSeek-V3.1.md DeepSeek-V3.2.md DeepSeek-R1.md Kimi-K2-Thinking pd_disaggregation_mooncake_single_node pd_disaggregation_mooncake_multi_node ray 310p :::