Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
e4e0b7af05a2b0faa72884c4bd5a628d35e0ba41
xc-llm-ascend/vllm_ascend/ops
History
lyj-jjj 5177bef87a support fused_moe_allgather_ep (#1335)
### What this PR does / why we need it?
support fused_moe_allgather_ep

### How was this patch tested?
It was tested by UT.

Signed-off-by: lyj-jjj <liuyingjun5@huawei.com>
2025-06-23 22:03:38 +08:00
..
__init__.py
[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph (#836)
2025-05-29 11:58:26 +08:00
activation.py
[Platform] Add initial experimental support for Altlas 300I series (#1333)
2025-06-21 09:00:16 +08:00
attention.py
[1/N][UT][v1 MTP] add basic v1 mtp features (#890)
2025-05-30 08:59:58 +08:00
cache.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
common_fused_moe.py
[Platform] Add initial experimental support for Altlas 300I series (#1333)
2025-06-21 09:00:16 +08:00
expert_load_balancer.py
Add static EPLB (#1116)
2025-06-09 19:28:11 +08:00
fused_moe.py
support fused_moe_allgather_ep (#1335)
2025-06-23 22:03:38 +08:00
layernorm.py
[Platform] Add initial experimental support for Altlas 300I series (#1333)
2025-06-21 09:00:16 +08:00
rotary_embedding.py
[Platform] Add initial experimental support for Altlas 300I series (#1333)
2025-06-21 09:00:16 +08:00
vocab_parallel_embedding.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
Powered by Gitea Version: 1.24.3 Page: 73ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API