xc-llm-ascend

Files

rjg-lyh 2bfbf9b9b3 [main][bugfix] Fix bugs and refactor cached mask generation logic (#2442 )

### What this PR does / why we need it?
This PR fix bugs and refactor cached mask generation logic. Now just
pre-construct and use the cached mask on cpu instead of device on npu.

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
CI passed with new added/existing test.

- vLLM version: v0.10.1.1
- vLLM main:
9b5f64238f

Signed-off-by: rjg-lyh <1318825571@qq.com>

2025-08-27 12:07:29 +08:00

e2e

[Embedding] Recover embedding function (#2483 )

2025-08-27 09:22:01 +08:00

[main][bugfix] Fix bugs and refactor cached mask generation logic (#2442 )

2025-08-27 12:07:29 +08:00

__init__.py

[SpecDecode] Add spec decode support (#500 )

2025-04-17 20:16:32 +08:00