[Version] Drop 0.16.0 support (#7153)
### What this PR does / why we need it?
Drop 0.16.0 support in main
- Fix eagle proposer break introduced by
https://github.com/vllm-project/vllm/pull/34552. Mainly change to use
the draft attention group to initialize the attention metadata builder.
- Fix the `ModelRunner` has no attribute `cudagraph_capture_sizes`
error, which is a bug in vLLM v0.17.0, and fixed by a later pr
https://github.com/vllm-project/vllm/pull/30515
- vLLM version: v0.16.0
- vLLM main:
4034c3d32e
---------
Signed-off-by: MengqingCao <cmq0113@163.com>
This commit is contained in:
@@ -75,9 +75,9 @@ myst_substitutions = {
|
||||
"pip_vllm_ascend_version": "0.16.0rc1",
|
||||
"pip_vllm_version": "0.16.0",
|
||||
# CANN image tag
|
||||
"cann_image_tag": "8.5.0-910b-ubuntu22.04-py3.11",
|
||||
"cann_image_tag": "8.5.1-910b-ubuntu22.04-py3.11",
|
||||
# vllm version in ci
|
||||
"ci_vllm_version": "v0.16.0",
|
||||
"ci_vllm_version": "v0.17.0",
|
||||
}
|
||||
|
||||
# For cross-file header anchors
|
||||
|
||||
Reference in New Issue
Block a user