[Version] Drop 0.16.0 support (#7153)
### What this PR does / why we need it?
Drop 0.16.0 support in main
- Fix eagle proposer break introduced by
https://github.com/vllm-project/vllm/pull/34552. Mainly change to use
the draft attention group to initialize the attention metadata builder.
- Fix the `ModelRunner` has no attribute `cudagraph_capture_sizes`
error, which is a bug in vLLM v0.17.0, and fixed by a later pr
https://github.com/vllm-project/vllm/pull/30515
- vLLM version: v0.16.0
- vLLM main:
4034c3d32e
---------
Signed-off-by: MengqingCao <cmq0113@163.com>
This commit is contained in:
@@ -39,7 +39,7 @@ on:
|
||||
vllm_version:
|
||||
required: false
|
||||
type: string
|
||||
default: "v0.16.0"
|
||||
default: "v0.17.0"
|
||||
is_pr_test:
|
||||
required: true
|
||||
type: boolean
|
||||
|
||||
Reference in New Issue
Block a user