xc-llm-ascend

Files

huangxialu 9c9a7cd90b [main] adapt usage of npu_moe_gating_top_k_softmax and remove envs.SELECT_GATING_TOPK_SOTFMAX_EXPERTS (#2112 )

backport of v0.9.1-dev:
https://github.com/vllm-project/vllm-ascend/pull/1902

origin main npu_moe_gating_top_k_softmax:
https://github.com/vllm-project/vllm-ascend/pull/1355

- vLLM version: v0.10.0
- vLLM main:
055bd3978e

Signed-off-by: huangxialu <huangxialu1@huawei.com>

2025-07-31 21:05:56 +08:00

__init__.py

[CI] Add unit test framework (#1201 )

2025-06-16 18:32:28 +08:00

test_bgmv_expand.py

Add Custom Kernels For LoRA Performance (#1884 )

2025-07-29 19:27:50 +08:00

test_bgmv_shrink.py

Add Custom Kernels For LoRA Performance (#1884 )

2025-07-29 19:27:50 +08:00

test_fused_moe.py

[main] adapt usage of npu_moe_gating_top_k_softmax and remove envs.SELECT_GATING_TOPK_SOTFMAX_EXPERTS (#2112 )