xc-llm-ascend

Files

whx 98cadc2146 [Perf] Avoid performing index selection of sin/cos cache every layer (#1890 )

Optimize number of index selections of sin/cos cache.

- vLLM version: v0.10.0
- vLLM main:
656c24f1b5

Signed-off-by: whx-sjtu <2952154980@qq.com>

2025-07-29 18:06:45 +08:00

__init__.py

2025-07-19 09:42:32 +08:00

eagle_proposer_v1.py

2025-07-28 15:59:09 +08:00

model_runner_v1.py

2025-07-29 18:06:45 +08:00

mtp_proposer_v1.py

2025-07-28 14:06:20 +08:00

npu_input_batch.py

2025-07-26 15:43:29 +08:00

worker_v1.py

2025-07-28 14:06:20 +08:00