xc-llm-ascend

Files

weijinqian0 2b3bfe432e [bugfix] Repair the problem of moe model accuracy caused by version upgrade. (#4562 )

Repair the problem of moe model accuracy caused by version upgrade.

Reason:
The new version adds the "reduce_output" operation after "forward_impl".

Then we have fully taken over the implementation of the FusedMoe module.


- vLLM version: v0.11.2
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

---------

Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
Co-authored-by: weijinqian_v1 <weijinqian@huawei.com>

2025-11-30 06:12:39 +08:00

fused_moe

[bugfix] Repair the problem of moe model accuracy caused by version upgrade. (#4562 )

2025-11-30 06:12:39 +08:00

triton

【OPS】qwen3-next support triton chunk_gated_delta_rule ops (#4070 )

2025-11-28 20:55:43 +08:00

__init__.py

[Refactor] [MoE] Rename moe-related classes & files (#3646 )

2025-10-25 11:22:03 +08:00

activation.py

[refact] unified soc_version code (#4359 )