### What this PR does / why we need it?
1. Add nightly test on MiniMax-M2.5 with deployment method on A3
2. Add MiniMax-M2.5 deployment introduction to vllm-ascend docs
- vLLM version: v0.17.0
- vLLM main:
4034c3d32e
---------
Signed-off-by: limuyuan <limuyuan3@huawei.com>
Signed-off-by: SparrowMu <52023119+SparrowMu@users.noreply.github.com>
Co-authored-by: limuyuan <limuyuan3@huawei.com>
654 B
654 B
Model Tutorials
This section provides tutorials for different models of vLLM Ascend.
:::{toctree} :caption: Model Tutorials :maxdepth: 1 Qwen2.5-Omni.md Qwen2.5-7B.md Qwen3-Dense.md Qwen-VL-Dense.md Qwen3-30B-A3B.md Qwen3-235B-A22B.md Qwen3-VL-30B-A3B-Instruct.md Qwen3-VL-235B-A22B-Instruct.md Qwen3-Coder-30B-A3B.md Qwen3_embedding.md Qwen3-VL-Embedding.md Qwen3_reranker.md Qwen3-VL-Reranker.md Qwen3-8B-W4A8.md Qwen3-32B-W4A4.md Qwen3-Next.md Qwen3-Omni-30B-A3B-Thinking.md Qwen3.5-27B.md Qwen3.5-397B-A17B.md DeepSeek-V3.1.md DeepSeek-V3.2.md DeepSeek-R1.md GLM4.x.md GLM5.md Kimi-K2-Thinking.md Kimi-K2.5.md PaddleOCR-VL.md MiniMax-M2.5.md :::