Files
xc-llm-ascend/vllm_ascend
ChrisGelhLan 5ebb9bd8d2 【Bugfix】bugfix_for_bmm_transpose (#4899)
The bmm_transpose operator in version 3.2 is only used in the decoding stage due to shape limitations.

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

---------

Signed-off-by: ChrisGelhLan <33011886+xlan-huawei@users.noreply.github.com>
2025-12-11 16:32:28 +08:00
..
2025-12-05 09:03:45 +08:00
2025-12-02 22:10:52 +08:00
2025-11-24 17:08:20 +08:00
2025-12-02 17:35:47 +08:00