xc-llm-ascend

Files

lidenghui1110 481138e1d2 [bugfix] adapt to new implemented get_kv_cache_spec in cpuoffload connector (#4311 )

### What this PR does / why we need it?
func `get_kv_cache_spec` in model_runner changed a lot and caused error
in cpuoffloading connector which is copied from model_runner, this PR
adapts to new implemented `get_kv_cache_spec` to fix it.

### How was this patch tested?

- vLLM version: v0.11.0
- vLLM main:
2918c1b49c

Signed-off-by: lidenghui <lidenghui1110@gmail.com>

2026-01-08 09:15:09 +08:00

__init__.py

[Feature]cpu offload connector (#1659 )

2025-09-23 14:25:05 +08:00

cpu_kv_cache_manager.py

upgrade vLLM to main (#4608 )

2025-12-02 22:10:52 +08:00

metadata.py

[bugfix] adapt to new implemented get_kv_cache_spec in cpuoffload connector (#4311 )

2026-01-08 09:15:09 +08:00