From 291c00a224ac12d1d26c4586f8fe37b5377f21a6 Mon Sep 17 00:00:00 2001 From: leo-pony Date: Thu, 16 Oct 2025 08:54:09 +0800 Subject: [PATCH] [Doc] pin version that can stable running 310I Duo to vllm-ascend v0.10.0rc1 (#3455) Pin version that can stable running 310I Duo to vllm-ascend v0.10.0rc1. ### What this PR does / why we need it? Since PR #2614 310I Duo been broken. Although we are currently working on fixing the issue with the 310I Duo being broken, there is no confirmed timeline for a fix in the short term. To allow users to quickly find a working version instead of going back and forth on trial and error, this PR fixes the version in the 310I Duo guide. ### Does this PR introduce _any_ user-facing change? NA ### How was this patch tested? NA - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: leo-pony --- docs/source/faqs.md | 2 +- docs/source/tutorials/single_node_300i.md | 4 +++- 2 files changed, 4 insertions(+), 2 deletions(-) diff --git a/docs/source/faqs.md b/docs/source/faqs.md index ec7f339..8bcb81e 100644 --- a/docs/source/faqs.md +++ b/docs/source/faqs.md @@ -15,7 +15,7 @@ Currently, **ONLY** Atlas A2 series(Ascend-cann-kernels-910b),Atlas A3 series( - Atlas 800I A2 Inference series (Atlas 800I A2) - Atlas A3 Training series (Atlas 800T A3, Atlas 900 A3 SuperPoD, Atlas 9000 A3 SuperPoD) - Atlas 800I A3 Inference series (Atlas 800I A3) -- [Experimental] Atlas 300I Inference series (Atlas 300I Duo) +- [Experimental] Atlas 300I Inference series (Atlas 300I Duo). Currently for 310I Duo the stable version is vllm-ascend v0.10.0rc1. Below series are NOT supported yet: - Atlas 200I A2 (Ascend-cann-kernels-310b) unplanned yet diff --git a/docs/source/tutorials/single_node_300i.md b/docs/source/tutorials/single_node_300i.md index 270d002..4109495 100644 --- a/docs/source/tutorials/single_node_300i.md +++ b/docs/source/tutorials/single_node_300i.md @@ -3,6 +3,8 @@ ```{note} 1. This Atlas 300I series is currently experimental. In future versions, there may be behavioral changes around model coverage, performance improvement. 2. Currently, the 310I series only supports eager mode and the data type is float16. +3. There are some known issues for running vLLM on 310p series, you can refer to vllm-ascend [#3316](https://github.com/vllm-project/vllm-ascend/issues/3316), + [#2795](https://github.com/vllm-project/vllm-ascend/issues/2795), you can use v0.10.0rc1 version first. ``` ## Run vLLM on Altlas 300I series @@ -12,7 +14,7 @@ Run docker container: ```{code-block} bash :substitutions: # Update the vllm-ascend image -export IMAGE=quay.io/ascend/vllm-ascend:|vllm_ascend_version|-310p +export IMAGE=quay.io/ascend/vllm-ascend:v0.10.0rc1-310p docker run --rm \ --name vllm-ascend \ --device /dev/davinci0 \