xc-llm-ascend

Author SHA1 Message Date

Author	SHA1	Message	Date
Shaoxu Cheng	e0e585a109	[310P]: add torch chunk gated delta rule and 910b parity ut (#7594 ) ### What this PR does / why we need it? RFC https://github.com/vllm-project/vllm-ascend/issues/7394 Add a PyTorch implementation of the chunk gated delta rule on 310P. ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? UT --------- Signed-off-by: Tflowers-0129 <2906339855@qq.com>	2026-03-25 16:46:43 +08:00
Shaoxu Cheng	83bd77c983	[310p]: add rmsnorm gated fallback and unit test (#7424 ) ### What this PR does / why we need it? RFC #7394 310P cannot use the fused `rmsnormgated` operator and must fall back to the native implementation. ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? ut - vLLM version: v0.17.0 - vLLM main: `4497431df6` --------- Signed-off-by: Tflowers-0129 <2906339855@qq.com>	2026-03-24 09:00:11 +08:00
Shaoxu Cheng	5b60b530d6	[Bugfix][310p] the new A5 mmencoder op donot support 310p (#7518 ) ### What this PR does / why we need it? Because the new A5 MMEncoder operator was merged, the 310P can no longer run any VL models. This PR fixes that issue. details at #7046 ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? e2e - vLLM version: v0.17.0 - vLLM main: `8b6325758c` --------- Signed-off-by: Tflowers-0129 <2906339855@qq.com>	2026-03-23 15:40:34 +08:00

Shaoxu Cheng

e0e585a109

[310P]: add torch chunk gated delta rule and 910b parity ut (#7594 )

### What this PR does / why we need it?
RFC https://github.com/vllm-project/vllm-ascend/issues/7394
Add a PyTorch implementation of the  chunk gated delta rule on 310P.
### Does this PR introduce _any_ user-facing change?
NO
### How was this patch tested?
UT

---------

Signed-off-by: Tflowers-0129 <2906339855@qq.com>

2026-03-25 16:46:43 +08:00

Shaoxu Cheng

83bd77c983

[310p]: add rmsnorm gated fallback and unit test (#7424 )

### What this PR does / why we need it?
RFC #7394
310P cannot use the fused `rmsnormgated` operator and must fall back to
the native implementation.

### Does this PR introduce _any_ user-facing change?
NO
### How was this patch tested?
ut
- vLLM version: v0.17.0
- vLLM main:
4497431df6

---------

Signed-off-by: Tflowers-0129 <2906339855@qq.com>

2026-03-24 09:00:11 +08:00

Shaoxu Cheng

5b60b530d6

[Bugfix][310p] the new A5 mmencoder op donot support 310p (#7518 )

### What this PR does / why we need it?

Because the new A5 MMEncoder operator was merged, the 310P can no longer
run any VL models. This PR fixes that issue. details at #7046

### Does this PR introduce _any_ user-facing change?
no
### How was this patch tested?
e2e
- vLLM version: v0.17.0
- vLLM main:
8b6325758c

---------

Signed-off-by: Tflowers-0129 <2906339855@qq.com>

2026-03-23 15:40:34 +08:00

3 Commits