xc-llm-ascend

Files

wangxiyuan 15b8aff582 [CI] Add max_split_size_mb for e2e test to avoid oom (#3252 )

### What this PR does / why we need it?
we add a patch for model weight loader to avoid using vLLM weight loader
v2, since v2 will lead unknown issue for torchair. While this patch make
some unknown memory usage problem. To quick fix the problem, let's
expend the `max_split_size_mb` to a larger value to avoid weight load
oom issue.

Further solution is to remove the patch and address weight loader v2
from vLLM.

Closes: https://github.com/vllm-project/vllm-ascend/issues/3251

### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?

- vLLM version: v0.10.2
- vLLM main:
https://github.com/vllm-project/vllm/commit/releases/v0.11.0

Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>

2025-09-29 09:13:08 +08:00

e2e

[CI] Add max_split_size_mb for e2e test to avoid oom (#3252 )

2025-09-29 09:13:08 +08:00

[CI][Bugfix] Quickfix for DPMetaData (#3234 )

2025-09-28 21:11:22 +08:00

__init__.py

[SpecDecode] Add spec decode support (#500 )

2025-04-17 20:16:32 +08:00