diff --git a/.github/ISSUE_TEMPLATE/900-release-checklist.yml b/.github/ISSUE_TEMPLATE/900-release-checklist.yml index 97fc3f7..9c13598 100644 --- a/.github/ISSUE_TEMPLATE/900-release-checklist.yml +++ b/.github/ISSUE_TEMPLATE/900-release-checklist.yml @@ -24,6 +24,8 @@ body: value: > - [ ] Create a new issue for release feedback + - [ ] Upgrade vllm version to the new version for CI and Dockerfile + - [ ] Write the release note PR. - [ ] Update the feedback issue link in docs/source/faqs.md diff --git a/README.md b/README.md index 6fd02ec..9c255b1 100644 --- a/README.md +++ b/README.md @@ -52,7 +52,7 @@ Please use the following recommended versions to get started quickly: | Version | Release type | Doc | |------------|--------------|--------------------------------------| -|v0.10.2rc1|Latest release candidate|[QuickStart](https://vllm-ascend.readthedocs.io/en/latest/quick_start.html) and [Installation](https://vllm-ascend.readthedocs.io/en/latest/installation.html) for more details| +|v0.11.0rc0|Latest release candidate|[QuickStart](https://vllm-ascend.readthedocs.io/en/latest/quick_start.html) and [Installation](https://vllm-ascend.readthedocs.io/en/latest/installation.html) for more details| |v0.9.1|Latest stable version|[QuickStart](https://vllm-ascend.readthedocs.io/en/v0.9.1-dev/quick_start.html) and [Installation](https://vllm-ascend.readthedocs.io/en/v0.9.1-dev/installation.html) for more details| ## Contributing @@ -73,7 +73,7 @@ Below is maintained branches: | Branch | Status | Note | |------------|--------------|--------------------------------------| -| main | Maintained | CI commitment for vLLM main branch and vLLM v0.10.2 tag | +| main | Maintained | CI commitment for vLLM main branch and vLLM v0.11.0 tag | | v0.7.1-dev | Unmaintained | Only doc fixed is allowed | | v0.7.3-dev | Maintained | CI commitment for vLLM 0.7.3 version, only bug fix is allowed and no new release tag any more. | | v0.9.1-dev | Maintained | CI commitment for vLLM 0.9.1 version | diff --git a/README.zh.md b/README.zh.md index e118b6b..bb7ddb9 100644 --- a/README.zh.md +++ b/README.zh.md @@ -53,7 +53,7 @@ vLLM 昇腾插件 (`vllm-ascend`) 是一个由社区维护的让vLLM在Ascend NP | Version | Release type | Doc | |------------|--------------|--------------------------------------| -|v0.10.2rc1| 最新RC版本 |请查看[快速开始](https://vllm-ascend.readthedocs.io/en/latest/quick_start.html)和[安装指南](https://vllm-ascend.readthedocs.io/en/latest/installation.html)了解更多| +|v0.11.0rc0| 最新RC版本 |请查看[快速开始](https://vllm-ascend.readthedocs.io/en/latest/quick_start.html)和[安装指南](https://vllm-ascend.readthedocs.io/en/latest/installation.html)了解更多| |v0.9.1| 最新正式/稳定版本 |[快速开始](https://vllm-ascend.readthedocs.io/en/v0.9.1-dev/quick_start.html) and [安装指南](https://vllm-ascend.readthedocs.io/en/v0.9.1-dev/installation.html)了解更多| ## 贡献 @@ -73,7 +73,7 @@ vllm-ascend有主干分支和开发分支。 | 分支 | 状态 | 备注 | |------------|------------|---------------------| -| main | Maintained | 基于vLLM main分支CI看护 | +| main | Maintained | 基于vLLM main分支和vLLM最新版本(v0.11.0)CI看护 | | v0.7.1-dev | Unmaintained | 只允许文档修复 | | v0.7.3-dev | Maintained | 基于vLLM v0.7.3版本CI看护, 只允许Bug修复,不会再发布新版本 | | v0.9.1-dev | Maintained | 基于vLLM v0.9.1版本CI看护 | diff --git a/docs/source/community/versioning_policy.md b/docs/source/community/versioning_policy.md index 96fcc26..8465570 100644 --- a/docs/source/community/versioning_policy.md +++ b/docs/source/community/versioning_policy.md @@ -22,6 +22,7 @@ Following is the Release Compatibility Matrix for vLLM Ascend Plugin: | vLLM Ascend | vLLM | Python | Stable CANN | PyTorch/torch_npu | MindIE Turbo | |-------------|--------------|------------------|-------------|--------------------|--------------| +| v0.11.0rc0 | v0.11.0rc3 | >= 3.9, < 3.12 | 8.2.RC1 | 2.7.1 / 2.7.1.dev20250724 | | | v0.10.2rc1 | v0.10.2 | >= 3.9, < 3.12 | 8.2.RC1 | 2.7.1 / 2.7.1.dev20250724 | | | v0.10.1rc1 | v0.10.1/v0.10.1.1 | >= 3.9, < 3.12 | 8.2.RC1 | 2.7.1 / 2.7.1.dev20250724 | | | v0.10.0rc1 | v0.10.0 | >= 3.9, < 3.12 | 8.2.RC1 | 2.7.1 / 2.7.1.dev20250724 | | @@ -43,6 +44,7 @@ Following is the Release Compatibility Matrix for vLLM Ascend Plugin: | Date | Event | |------------|-------------------------------------------| +| 2025.09.30 | Release candidates, v0.11.0rc0 | | 2025.09.16 | Release candidates, v0.10.2rc1 | | 2025.09.04 | Release candidates, v0.10.1rc1 | | 2025.09.03 | v0.9.1 Final release | diff --git a/docs/source/conf.py b/docs/source/conf.py index 6444665..d864a3b 100644 --- a/docs/source/conf.py +++ b/docs/source/conf.py @@ -65,15 +65,15 @@ myst_substitutions = { # the branch of vllm, used in vllm clone # - main branch: 'main' # - vX.Y.Z branch: 'vX.Y.Z' - 'vllm_version': 'v0.10.2', + 'vllm_version': 'v0.11.0rc3', # the branch of vllm-ascend, used in vllm-ascend clone and image tag # - main branch: 'main' # - vX.Y.Z branch: latest vllm-ascend release tag - 'vllm_ascend_version': 'v0.10.2rc1', + 'vllm_ascend_version': 'v0.11.0rc0', # the newest release version of vllm-ascend and matched vLLM, used in pip install. # This value should be updated when cut down release. - 'pip_vllm_ascend_version': "0.10.2rc1", - 'pip_vllm_version': "0.10.2", + 'pip_vllm_ascend_version': "0.11.0rc0", + 'pip_vllm_version': "0.11.0", # CANN image tag 'cann_image_tag': "8.2.rc1-910b-ubuntu22.04-py3.11", # vllm version in ci diff --git a/docs/source/faqs.md b/docs/source/faqs.md index bd2b479..ec7f339 100644 --- a/docs/source/faqs.md +++ b/docs/source/faqs.md @@ -3,7 +3,7 @@ ## Version Specific FAQs - [[v0.9.1] FAQ & Feedback](https://github.com/vllm-project/vllm-ascend/issues/2643) -- [[v0.10.2rc1] FAQ & Feedback](https://github.com/vllm-project/vllm-ascend/issues/2874) +- [[v0.11.0rc1] FAQ & Feedback](https://github.com/vllm-project/vllm-ascend/issues/3222) ## General FAQs diff --git a/docs/source/user_guide/release_notes.md b/docs/source/user_guide/release_notes.md index cd58a7b..3a0bcca 100644 --- a/docs/source/user_guide/release_notes.md +++ b/docs/source/user_guide/release_notes.md @@ -1,5 +1,29 @@ # Release note +## v0.11.0rc0 - 2025.09.30 + +This is the special release candidate of v0.11.0 for vLLM Ascend. Please follow the [official doc](https://vllm-ascend.readthedocs.io/en/) to get started. + +### Highlights + +- DeepSeek V3.2 is supported now. [#3270](https://github.com/vllm-project/vllm-ascend/pull/3270) +- Qwen3-vl is supported now. [#3103](https://github.com/vllm-project/vllm-ascend/pull/3103) + +### Core + +- DeepSeek works with aclgraph now. [#2707](https://github.com/vllm-project/vllm-ascend/pull/2707) +- MTP works with aclgraph now. [#2932](https://github.com/vllm-project/vllm-ascend/pull/2932) +- EPLB is supported now. [#2956](https://github.com/vllm-project/vllm-ascend/pull/2956) +- Mooncacke store kvcache connector is supported now. [#2913](https://github.com/vllm-project/vllm-ascend/pull/2913) +- CPU offload connector is supported now. [#1659](https://github.com/vllm-project/vllm-ascend/pull/1659) + +### Other + +- Qwen3-next is stable now. [#3007](https://github.com/vllm-project/vllm-ascend/pull/3007) +- Fixed a lot of bugs introduced in v0.10.2 by Qwen3-next. [#2964](https://github.com/vllm-project/vllm-ascend/pull/2964) [#2781](https://github.com/vllm-project/vllm-ascend/pull/2781) [#3070](https://github.com/vllm-project/vllm-ascend/pull/3070) [#3113](https://github.com/vllm-project/vllm-ascend/pull/3113) +- The LoRA feature is back now. [#3044](https://github.com/vllm-project/vllm-ascend/pull/3044) +- Eagle3 spec decode method is back now. [#2949](https://github.com/vllm-project/vllm-ascend/pull/2949) + ## v0.10.2rc1 - 2025.09.16 This is the 1st release candidate of v0.10.2 for vLLM Ascend. Please follow the [official doc](https://vllm-ascend.readthedocs.io/en/) to get started. diff --git a/vllm_ascend/platform.py b/vllm_ascend/platform.py index c1bf203..f1581df 100644 --- a/vllm_ascend/platform.py +++ b/vllm_ascend/platform.py @@ -212,7 +212,7 @@ class NPUPlatform(Platform): vllm_config._set_cudagraph_sizes() # TODO: Full graph is fully supported later, and the default value will be set to full graph. - if not vllm_version_is("v0.10.2"): + if not vllm_version_is("0.10.2"): if compilation_config.cudagraph_mode == CUDAGraphMode.FULL_AND_PIECEWISE: compilation_config.cudagraph_mode = CUDAGraphMode.PIECEWISE