Files
xc-llm-ascend/vllm_ascend
NeverRaR df84cceca8 perf: use multicast to avoid padding decode request to prefill size (#1555)
### What this PR does / why we need it?
perf: use multicast to avoid padding decode request to prefill size

### How was this patch tested?

- vLLM version: v0.9.1
- vLLM main:
1fd471e957

Signed-off-by: boying <897013703@qq.com>
2025-07-07 22:36:03 +08:00
..
2025-04-22 08:57:25 +08:00
2025-06-23 22:03:38 +08:00