wangxiyuan
bc69d7cfe1
upgrade to vllm 0.11.2 ( #4400 )
...
Bump vLLM version to v0.11.2
What's broken and changed by vLLM:
1. structured_output is broken by
https://github.com/vllm-project/vllm/pull/26866
2. get_mrope_input_positions is broken by
https://github.com/vllm-project/vllm/pull/28399
3. graph mode is broken by
https://github.com/vllm-project/vllm/pull/25110 we'll upgrade torch to
2.8 to fix the problem later
4. embedding is broken by
https://github.com/vllm-project/vllm/pull/27583
5. `get_attn_backend_cls` and attention backend is broken are broken by
https://github.com/vllm-project/vllm/pull/28534
6. spec decode is broken by
https://github.com/vllm-project/vllm/pull/28771
7. sp feature is broken by
https://github.com/vllm-project/vllm/pull/27126
8. mtp is broken by https://github.com/vllm-project/vllm/pull/27922
9. lora is broken by https://github.com/vllm-project/vllm/pull/21068
10. execute_model is broken by
https://github.com/vllm-project/vllm/pull/26866
11. `VLLM_DISABLE_SHARED_EXPERTS_STREAM` env is broken by
https://github.com/vllm-project/vllm/pull/28159
12. kv cahe is broken by https://github.com/vllm-project/vllm/pull/27753
13. dp is broken by https://github.com/vllm-project/vllm/pull/25110
What's broken and changed by ourself:
1. qwen vl is broken by https://github.com/vllm-project/vllm/pull/28455
We'll remove model files in the future to avoid this kind of error
2. Engine core is broken by
https://github.com/vllm-project/vllm/pull/23691 We'll remove the patch
file in the future.
3. Ascend scheduler is broken by
https://github.com/vllm-project/vllm/pull/28733 We'll remove ascend
scheudler later.
4. qwen3-next is broken by
https://github.com/vllm-project/vllm/pull/28083 We'll remove model files
in the future to avoid this kind of error
5. qwen vl is broken by https://github.com/vllm-project/vllm/pull/27764 .
We'll remove model files in the future
Known issue:
1. ray doesn't work
2. the accuracy of qwen3-next is not correct
3. qwen3-vl is broken
4. prefix cache+ ascend scheduler + deepseek v2 lite is broken.
Co-authored-by: MengqingCao <cmq0113@163.com >
Co-authored-by: hfadzxy <starmoon_zhang@163.com >
Co-authored-by: leo-pony <nengjunma@outlook.com >
Co-authored-by: 22dimensions <waitingwind@foxmail.com >
Co-authored-by: shen-shanshan <467638484@qq.com >
- vLLM version: v0.11.2
---------
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
Signed-off-by: MengqingCao <cmq0113@163.com >
Signed-off-by: hfadzxy <starmoon_zhang@163.com >
Signed-off-by: leo-pony <nengjunma@outlook.com >
Co-authored-by: MengqingCao <cmq0113@163.com >
Co-authored-by: hfadzxy <starmoon_zhang@163.com >
Co-authored-by: leo-pony <nengjunma@outlook.com >
2025-11-26 11:48:58 +08:00
wangxiyuan
fff258bce1
[Doc] add release note for v0.11.0rc2 ( #4348 )
...
add release note for v0.11.0rc2
- vLLM version: v0.11.0
- vLLM main:
2918c1b49c
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-11-21 23:03:32 +08:00
lilinsiman
adee9dd3b1
[Info][main] Correct the mistake in information documents ( #4157 )
...
### What this PR does / why we need it?
Correct the mistake in information documents
### Does this PR introduce _any_ user-facing change?
no
### How was this patch tested?
ut
- vLLM version: v0.11.0
- vLLM main:
2918c1b49c
---------
Signed-off-by: lilinsiman <lilinsiman@gmail.com >
2025-11-13 15:53:58 +08:00
22dimensions
c272747d13
Upgrade to 0.11.1 newest vllm commit ( #3982 )
...
### What this PR does / why we need it?
adapt vllm-ascend main branch with vllm releases/v0.11.1
fix `forward context not set` in test_vlm.py caused by:
https://github.com/vllm-project/vllm/pull/23207
fix import `cdiv round` failed caused by:
https://github.com/vllm-project/vllm/pull/27188
fix import `init_cached_hf_modules` failed caused by:
https://github.com/vllm-project/vllm/pull/27567
adapt triton kernel `fused_recurrent_gated_delta_rule_fwd_kernel` caused
by: https://github.com/vllm-project/vllm/pull/27654
- remove unused code in sigmoid_gating.py
- `class FusedRecurrentFunction` , `fused_recurrent_gated_delta_rule`,
`fused_recurrent_gated_delta_rule_fwd`
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
CI
- vLLM version: v0.11.0
- vLLM main:
83f478bb19
Signed-off-by: 22dimensions <waitingwind@foxmail.com >
2025-11-12 23:01:19 +08:00
wangxiyuan
64220c68c5
[Doc] Add release note for v0.11.0rc1 ( #3931 )
...
Add release note for v0.11.0rc1.
- vLLM version: v0.11.0
- vLLM main:
83f478bb19
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-11-10 21:01:50 +08:00
wangxiyuan
3ac76fdccc
[Doc] Update version policy ( #3999 )
...
Add version policy for main branch to clear how vllm-ascend work with
vllm
- vLLM version: v0.11.0
- vLLM main:
83f478bb19
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-11-05 14:55:54 +08:00
zhangxinyuehfad
789ba4c5c2
[Doc] Update doc ( #3836 )
...
### What this PR does / why we need it?
Update doc
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
- vLLM version: v0.11.0rc3
- vLLM main:
https://github.com/vllm-project/vllm/commit/releases/v0.11.1
Signed-off-by: hfadzxy <starmoon_zhang@163.com >
2025-10-29 11:03:39 +08:00
wangxiyuan
1a9feb3ba5
Update version doc ( #3599 )
...
1. Add v0.11.0-dev branch info
2. mark rfc/long_seq_optimization branch as completed
- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-10-25 09:37:56 +08:00
wangxiyuan
00ba071022
[Doc] Release note for v0.11.0rc0 ( #3224 )
...
### What this PR does / why we need it?
Add release note for v0.11.0rc0
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
- vLLM version: v0.11.0rc3
- vLLM main:
https://github.com/vllm-project/vllm/commit/releases/v0.11.0
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-09-30 03:26:18 +08:00
wangxiyuan
048bfd5553
[Release] Add release note for v0.10.2rc1 ( #2921 )
...
Add release note for v0.10.2rc1
- vLLM version: v0.10.2
- vLLM main:
b834b4cbf1
---------
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-09-16 01:20:05 +08:00
Mengqing Cao
7e16b4a7cd
[ReleaseNote] Add Release Note for v0.10.1rc1 ( #2635 )
...
Add Release Note for v0.10.1rc1
- vLLM version: v0.10.1.1
- vLLM main:
b5ee1e3261
---------
Signed-off-by: MengqingCao <cmq0113@163.com >
2025-09-04 11:26:47 +08:00
wangxiyuan
41b028aa5f
[Doc] add v0.9.1 release note ( #2646 )
...
Add release note for 0.9.1
- vLLM version: v0.10.1.1
- vLLM main:
8bd5844989
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-09-03 18:04:27 +08:00
Shanshan Shen
334c44613a
[Doc] Update release version info ( #2518 )
...
### What this PR does / why we need it?
Update release version info.
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
- vLLM version: v0.10.1.1
- vLLM main:
712d0f88d8
Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com >
2025-08-25 15:39:10 +08:00
Shanshan Shen
98c68220c1
[Doc] Update v0.9.1rc3 doc ( #2512 )
...
### What this PR does / why we need it?
Update `v0.9.1rc3` doc, which are supplements to
https://github.com/vllm-project/vllm-ascend/pull/2488 .
- vLLM version: v0.10.0
- vLLM main:
170e8ea9ea
Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com >
2025-08-25 11:39:29 +08:00
LookAround0301
e9fb895b10
[Doc] Add feature branch long_seq_optimization ( #2477 )
...
### What this PR does / why we need it?
Add cp/sp feature branch
- vLLM version: v0.10.0
- vLLM main:
0c6e40bbaa
Signed-off-by: LookAround <lixushi@huawei.com >
2025-08-22 08:53:12 +08:00
Yikun Jiang
67a222c383
[Doc] Add feature branch policy ( #2432 )
...
### What this PR does / why we need it?
This patch add the feature branch policy.
After this patch: maintainers are allowed to create a feature branch.
Feature branches are used for collaboration and must include an RFC
link, merge plan and mentor info.
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
CI passed
- vLLM version: v0.10.0
- vLLM main:
7be5d113d8
Signed-off-by: Yikun Jiang <yikunkero@gmail.com >
2025-08-21 10:37:21 +08:00
Mengqing Cao
4604882a3e
[ReleaseNote] Release note of v0.10.0rc1 ( #2225 )
...
### What this PR does / why we need it?
Release note of v0.10.0rc1
- vLLM version: v0.10.0
- vLLM main:
8e8e0b6af1
---------
Signed-off-by: MengqingCao <cmq0113@163.com >
2025-08-07 14:46:49 +08:00
Yikun Jiang
54ace9e12b
Add release note for v0.9.1rc2 ( #2188 )
...
### What this PR does / why we need it?
Add release note for v0.9.1rc2
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
CI passed
- vLLM version: v0.10.0
- vLLM main:
c494f96fbc
Signed-off-by: Yikun Jiang <yikunkero@gmail.com >
2025-08-06 09:04:46 +08:00
wangxiyuan
9c560b009a
[Release] Add 0.9.2rc1 release note ( #1725 )
...
Add release note for 0.9.2rc1, we'll release soon
- vLLM version: v0.9.2
- vLLM main:
7bd4c37ae7
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-07-11 17:36:05 +08:00
Yikun Jiang
e4e9ea02ab
Upgrade vLLM version to v0.9.2 ( #1652 )
...
### What this PR does / why we need it?
This patch upgrade vLLM version to v0.9.2, this patch didn't remove the
v0.9.1 compatible code to easy review.
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
- vLLM version: v0.9.1
- vLLM main:
14601f5fba
- Accuracy test with 0.9.2:
https://github.com/vllm-project/vllm-ascend/actions/runs/16121612087
Signed-off-by: Yikun Jiang <yikunkero@gmail.com >
2025-07-08 14:18:17 +08:00
wangxiyuan
e4e0b7af05
[Doc] Add patch doc ( #1414 )
...
1. Format the developer guide content to make it more clear
2. Add the patch doc for developer guide
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com >
2025-06-25 12:00:45 +08:00