xc-llm-ascend

Files

zhangxinyuehfad 819a4459ce Drop vLLM 0.13.0 support (#6069 )

### What this PR does / why we need it?
Drop vLLM 0.13.0 support, upgrade to 0.14.0

- vLLM version: v0.13.0
- vLLM main:
d68209402d

---------

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

2026-01-23 09:45:08 +08:00

__init__.py

[Refactor] [MoE] Rename moe-related classes & files (#3646 )

2025-10-25 11:22:03 +08:00

comm_utils.py

[Refactor] [MoE] Rename moe-related classes & files (#3646 )

2025-10-25 11:22:03 +08:00

experts_selector.py

[Kernel] Add moe_gating_top_k operator support for Ascend NPU (#5579 )

2026-01-07 21:42:31 +08:00

fused_moe.py

Drop vLLM 0.13.0 support (#6069 )

2026-01-23 09:45:08 +08:00

moe_comm_method.py

[BugFix] Fix input parameter bug of dispatch_gmm_combine_decode[RFC: issue 5476] (#5932 )

2026-01-21 09:26:40 +08:00

moe_mlp.py

[refactor] Remove unnecessary attributes from set_ascend_forward_context (#5204 )

2025-12-23 08:49:52 +08:00

prepare_finalize.py

[Feature] Support fine-grained shared expert overlap (#5482 )

2026-01-17 11:53:22 +08:00

token_dispatcher.py

[BugFix] Fix input parameter bug of dispatch_gmm_combine_decode[RFC: issue 5476] (#5932 )

2026-01-21 09:26:40 +08:00