### What this PR does / why we need it? Modify the recalculation logic to prevent waiting requests from filling up the D node KVCache - vLLM version: v0.11.0rc3 - vLLM main: 17c540a993 Signed-off-by: underfituu <hzhucong@163.com>
17c540a993