xc-llm-ascend

Files

weichen 3a5fc5ee01 [Refactor][MoE] remove redundant code after refactoring fused_moe (#2612 )

### What this PR does / why we need it?
There are a lot of redundant codes related to moe here, and the
structure is not very clear.
We did the following things：

we have placed the relatively independent code related to apply_mlp into
a separate file;
removed the environment variables of alltoall_buffer and alltoall_seq.
Remove the code related to alltoall_buffer and alltoall_seq, and retain
the sole TokenDispatcher inheritance class.
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
e2e&ut

- vLLM version: v0.10.1.1
- vLLM main:
4071c76cf3

---------

Signed-off-by: Pr0Wh1teGivee <calvin_zhu0210@outlook.com>
Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
Co-authored-by: weijinqian0 <12153182+weijinqian0@users.noreply.github.com>

2025-08-30 22:28:50 +08:00

matchers

[Core] Init vllm-ascend (#3 )

2025-02-05 10:53:12 +08:00

accuracy_test.yaml

[CI] Upgrade vllm in accuracy and performance CI (#2527 )

2025-08-26 08:49:49 +08:00

format_pr_body.yaml

[CI] fix ci (#2464 )

2025-08-22 07:30:48 +08:00

image_310p_openeuler.yml

[CI] fix ci (#2464 )