xc-llm-ascend

Files

Li Wang 01592515b8 [Bugfix] Fix sleep mode level 2 (#1376 )

### What this PR does / why we need it?
For sleep mode level 2, we discarded model both weights and kv_cache,
but the problems is: When we discard weights, we also discard some
tensors representing the model state which we called
`model.named_buffers()`, such as: `running_mean / running_var` in
BatchNorm、rope cos-sin cache ... when we update weights, but forgot to
update buffers as well, this will lead to some unknown issue
### Does this PR introduce _any_ user-facing change?

### How was this patch tested?


- vLLM version: v0.10.2
- vLLM main:
5963b98b46

---------

Signed-off-by: wangli <wangli858794774@gmail.com>

2025-09-18 19:51:52 +08:00

e2e

[Feat][Graph] Support MTP for ACL Graph (#2932 )

2025-09-18 14:05:33 +08:00

[Bugfix] Fix sleep mode level 2 (#1376 )

2025-09-18 19:51:52 +08:00

__init__.py

[SpecDecode] Add spec decode support (#500 )

2025-04-17 20:16:32 +08:00