aipaes
1c0ecf806a
[bugfix] fix pass bug: pass really rope dim for npu_rotary_embedding (#6880)
### What this PR does / why we need it?
pass really rope dim for npu_rotary_embedding
**before:**
q_rope, k_rope = torch.ops.vllm.npu_rotary_embedding(
positions, q_flat, k_flat, cos_sin_cache, self.head_dim,
**self.head_dim,** True
)
**after:**
q_rope, k_rope = torch.ops.vllm.npu_rotary_embedding(
positions, q_flat, k_flat, cos_sin_cache, self.head_dim,
**self.rope_dim,** True
)
### Does this PR introduce _any_ user-facing change?
no
### How was this patch tested?
- vLLM version: v0.16.0
- vLLM main:
15d76f74e2
---------
Signed-off-by: zjks98 <zhangjiakang4@huawei.com>
Signed-off-by: aipaes <82140963+aipaes@users.noreply.github.com>
Co-authored-by: zjks98 <zhangjiakang4@huawei.com>
2026-03-06 19:35:17 +08:00
..
2026-03-03 15:57:26 +08:00
2025-11-28 18:06:39 +08:00
2026-03-06 17:11:22 +08:00
2026-03-06 19:35:17 +08:00
2026-03-06 10:01:33 +08:00
2026-03-02 18:17:01 +08:00
2026-01-19 08:59:46 +08:00
2026-03-05 21:04:11 +08:00
2026-03-06 09:53:29 +08:00
2026-02-27 16:05:21 +08:00
2026-01-24 22:45:38 +08:00
2026-01-24 22:08:33 +08:00
2026-03-06 17:14:08 +08:00
2026-03-06 17:14:08 +08:00
2026-03-06 17:24:22 +08:00
2026-03-05 09:12:40 +08:00
2026-03-06 17:11:22 +08:00
2026-03-06 17:14:08 +08:00
2026-02-06 15:35:06 +08:00
2026-01-16 20:57:46 +08:00
2026-03-05 09:12:40 +08:00
2026-03-02 11:04:06 +08:00
2026-03-05 09:12:40 +08:00
2026-03-03 17:20:52 +08:00
2026-02-09 15:38:58 +08:00
2026-01-16 20:57:46 +08:00
2026-02-07 09:24:05 +08:00
2026-03-02 17:54:25 +08:00
2026-02-01 20:06:01 +08:00
2026-03-06 19:35:17 +08:00