Files
xc-llm-ascend/vllm_ascend
NeverRaR 807686dec9 perf : optimize memory for deepseek mtp (#2713)
### What this PR does / why we need it?
delete the temp tensor to optimize memory for deepseek mtp for torchair
case

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

Signed-off-by: boying <897013703@qq.com>
2025-10-23 15:52:17 +08:00
..
2025-10-22 14:13:32 +08:00
2025-10-21 22:58:02 +08:00
2025-10-09 10:28:38 +08:00
2025-10-15 19:36:32 +08:00