xc-llm-ascend

Files

wangxiyuan 830332ebfc Clean up v0.9.1 code (#1672 )

vllm has released 0.9.2. This PR drop 0.9.1 support.

- vLLM version: v0.9.1
- vLLM main:
b942c094e3

Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>

2025-07-09 08:52:24 +08:00

__init__.py

[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph (#836 )

2025-05-29 11:58:26 +08:00

activation.py

[Platform] Add initial experimental support for Altlas 300I series (#1333 )

2025-06-21 09:00:16 +08:00

attention.py

[1/N][UT][v1 MTP] add basic v1 mtp features (#890 )

2025-05-30 08:59:58 +08:00

cache.py

port deepseekv2 and mtp to main branch (#429 )

2025-04-19 17:38:18 +08:00

common_fused_moe.py

[Bugfix] Support Qwen3-MOE on aclgraph mode (#1381 )

2025-07-06 15:29:36 +08:00

expert_load_balancer.py

Add static EPLB (#1116 )

2025-06-09 19:28:11 +08:00

fused_moe.py

Clean up v0.9.1 code (#1672 )

2025-07-09 08:52:24 +08:00

layernorm.py

[Platform] Add initial experimental support for Altlas 300I series (#1333 )

2025-06-21 09:00:16 +08:00

rotary_embedding.py

[CORE]initial support for torchair with non-mla backend (#1506 )

2025-07-03 22:21:42 +08:00

vocab_parallel_embedding.py

port deepseekv2 and mtp to main branch (#429 )

2025-04-19 17:38:18 +08:00