### What this PR does / why we need it? This patch purpose to add the `update_max_model_len` interface. - vLLM version: v0.14.0 - vLLM main: d68209402d --------- Signed-off-by: wangli <wangli858794774@gmail.com>
d68209402d
vllm-ascend/
vllm-ascend/compilation
dp
sp