xc-llm-ascend

Files

lhp-deep b230e7e987 [MOE]move weight transpose to wakeup for RL secnarios (#4626 )

### What this PR does / why we need it?
In reinforcement learning scenarios, the current inference applies a
transpose operation to the weights. For a cleaner architecture, the
weight transpose module was moved to wakeup.

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

Signed-off-by: lhp-deep <liuhaopeng1@huawei.com>
Co-authored-by: weijinqian0 <1184188277@qq.com>

2025-12-08 20:34:52 +08:00

fused_moe

[MOE]move weight transpose to wakeup for RL secnarios (#4626 )

2025-12-08 20:34:52 +08:00

triton

[Model] Add qwen3Next support in Main (#4596 )

2025-12-03 14:17:37 +08:00

__init__.py

[Refactor] [MoE] Rename moe-related classes & files (#3646 )

2025-10-25 11:22:03 +08:00

activation.py

[refact] unified soc_version code (#4359 )