### What this PR does / why we need it?
Fixes https://github.com/vllm-project/vllm-ascend/issues/5201
### Does this PR introduce _any_ user-facing change?
No, doc only
### How was this patch tested?
- vLLM version: release/v0.13.0
- vLLM main:
ad32e3e19c
Signed-off-by: rongfu.leng <lenronfu@gmail.com>