xc-llm-ascend

Files

jiangmengyu18 85234d096d [v0.18.0][Feature] support qkv_rmsnorm_mrope for qwen3vl (#7852 )

### What this PR does / why we need it?
Qwen3vl full attention supports enabling the split_qkv_rmsnorm_mrope
fusion operator.
### Does this PR introduce _any_ user-facing change?

### How was this patch tested?
- [x] Run Qwen3-VL dense model with the fusion operator, verify correct
output
- [x] Run Qwen3-VL MoE model with the fusion operator, verify correct
output

---------

Signed-off-by: jiangmengyu18 <451528648@qq.com>
Signed-off-by: jiangmengyu18 <56633611+jiangmengyu18@users.noreply.github.com>
Signed-off-by: betta18 <jiangmengyu1@huawei.com>
Co-authored-by: betta18 <jiangmengyu1@huawei.com>

2026-04-02 17:46:50 +08:00

platform

fix(platform): reimplement MiniMax usage accounting patch (#7835 )

2026-03-31 16:27:00 +08:00

worker

[v0.18.0][Feature] support qkv_rmsnorm_mrope for qwen3vl (#7852 )

2026-04-02 17:46:50 +08:00

__init__.py

[v0.18.0][Feature] support qkv_rmsnorm_mrope for qwen3vl (#7852 )

2026-04-02 17:46:50 +08:00