[Ops] Add layernorm for qwen3Next (#5765)
### What this PR does / why we need it?
Add layernormFn triton op for qwen3Next model for better performance.
<img width="248" height="526" alt="image"
src="https://github.com/user-attachments/assets/27b47157-5df5-4db1-aa88-1dae799b2bf6"
/>
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
- vLLM version: v0.13.0
- vLLM main:
2f4e6548ef
---------
Signed-off-by: SunnyLee219 <3294305115@qq.com>
This commit is contained in:
2
.github/workflows/nightly_test_a2.yaml
vendored
2
.github/workflows/nightly_test_a2.yaml
vendored
@@ -54,7 +54,7 @@ jobs:
|
||||
tests: tests/e2e/nightly/single_node/models/test_qwen3_8b.py
|
||||
- name: qwen3next
|
||||
os: linux-aarch64-a2-4
|
||||
ests: tests/e2e/nightly/single_node/models/test_qwen3_next.py
|
||||
tests: tests/e2e/nightly/single_node/models/test_qwen3_next.py
|
||||
- name: qwen3-32b
|
||||
os: linux-aarch64-a2-4
|
||||
tests: tests/e2e/nightly/single_node/models/test_qwen3_32b.py
|
||||
|
||||
Reference in New Issue
Block a user