xc-llm-ascend

Author SHA1 Message Date

Author	SHA1	Message	Date
Shanshan Shen	aeb5aa8b88	[Misc][V0 Deprecation] Add `__main__` guard to all offline examples (#1837 ) ### What this PR does / why we need it? Add `__main__` guard to all offline examples. - vLLM version: v0.9.2 - vLLM main: `76b494444f` --------- Signed-off-by: shen-shanshan <467638484@qq.com>	2025-07-17 14:13:30 +08:00
zhuo97	f5404dc650	Fix the device error when using ray as vllm-acend backend (#884 ) 1. Remove RAY_EXPERIMENTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES 2. Add lazy init for vllm_ascend_C Signed-off-by: zhuo97 <1103045176@qq.com>	2025-06-16 21:03:16 +08:00
Wan_Danfeng	5cf9ff18e9	[Performance]: Custom AscendC Kernel of Multi-Step Prepare Input (#814 ) ### What this PR does / why we need it? - According to https://github.com/vllm-project/vllm-ascend/issues/807, we pull request for customer ascendc kernel of multi-step. - also a bug we found in multi_step_runner.py is fixed when we use multi-step on V0 Engine. ### Does this PR introduce _any_ user-facing change? no user-facing change ### How was this patch tested? we add Unit Test file and offline inference file to test the custom ascendc kernel. See test/ops/test_multi_step.py and examples/offline_multi_step.py --------- Signed-off-by: wan_danfeng <wonderful199082@126.com>	2025-05-20 09:31:30 +08:00

Shanshan Shen

aeb5aa8b88

[Misc][V0 Deprecation] Add __main__ guard to all offline examples (#1837 )

### What this PR does / why we need it?
Add `__main__` guard to all offline examples.

- vLLM version: v0.9.2
- vLLM main:
76b494444f

---------

Signed-off-by: shen-shanshan <467638484@qq.com>

2025-07-17 14:13:30 +08:00

zhuo97

f5404dc650

Fix the device error when using ray as vllm-acend backend (#884 )

1. Remove RAY_EXPERIMENTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES
2. Add lazy init for vllm_ascend_C

Signed-off-by: zhuo97 <1103045176@qq.com>

2025-06-16 21:03:16 +08:00

Wan_Danfeng

5cf9ff18e9

[Performance]: Custom AscendC Kernel of Multi-Step Prepare Input (#814 )

### What this PR does / why we need it?

- According to https://github.com/vllm-project/vllm-ascend/issues/807,
we pull request for customer ascendc kernel of multi-step.
- also a bug we found in multi_step_runner.py is fixed when we use
multi-step on V0 Engine.


### Does this PR introduce _any_ user-facing change?

no user-facing change


### How was this patch tested?
we add Unit Test file and offline inference file to test the custom
ascendc kernel. See test/ops/test_multi_step.py and
examples/offline_multi_step.py

---------

Signed-off-by: wan_danfeng <wonderful199082@126.com>

2025-05-20 09:31:30 +08:00

3 Commits