[Ops] Add layernorm for qwen3Next (#5765)

### What this PR does / why we need it?
Add layernormFn triton op for qwen3Next model for better performance.

<img width="248" height="526" alt="image"
src="https://github.com/user-attachments/assets/27b47157-5df5-4db1-aa88-1dae799b2bf6"
/>

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?

- vLLM version: v0.13.0
- vLLM main:
2f4e6548ef

---------

Signed-off-by: SunnyLee219 <3294305115@qq.com>
This commit is contained in:
LeeWenquan
2026-01-20 14:43:14 +08:00
committed by GitHub
parent 0664c6e67a
commit 55b20ac63b
4 changed files with 254 additions and 4 deletions

View File

@@ -54,7 +54,7 @@ jobs:
tests: tests/e2e/nightly/single_node/models/test_qwen3_8b.py
- name: qwen3next
os: linux-aarch64-a2-4
ests: tests/e2e/nightly/single_node/models/test_qwen3_next.py
tests: tests/e2e/nightly/single_node/models/test_qwen3_next.py
- name: qwen3-32b
os: linux-aarch64-a2-4
tests: tests/e2e/nightly/single_node/models/test_qwen3_32b.py