Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Projects Releases Wiki Activity
Files
890701081582dcce85d2b699c8a9d5ef0eefa638
xc-llm-ascend/vllm_ascend/ops
History
wangxiyuan 6360eb1dea Revert "[Bugfix] Fix Qwen2.5-Omni-7B accuarcy test (#4556)" (#4619)
This reverts commit 71e9b379c8. It breaks vllm-ascend/Qwen3-30B-A3B-W8A8 test
2025-12-02 13:15:47 +08:00
..
fused_moe
[Bugfix]Fix eplb enable when using mtp float weights. (#4571)
2025-12-02 09:20:49 +08:00
triton
【OPS】qwen3-next support triton chunk_gated_delta_rule ops (#4070)
2025-11-28 20:55:43 +08:00
__init__.py
…
activation.py
[refact] unified soc_version code (#4359)
2025-11-26 14:28:55 +08:00
attention.py
…
expert_load_balancer.py
eplb redundant expert bugfix (#4291)
2025-11-21 14:24:35 +08:00
layernorm.py
Revert "[Bugfix] Fix Qwen2.5-Omni-7B accuarcy test (#4556)" (#4619)
2025-12-02 13:15:47 +08:00
linear_op.py
[Feat] flashcomm_v2 optim solution (#3232)
2025-11-10 11:01:45 +08:00
linear.py
[Bugfix] Remove ModelSlim-"M4 Quantization". (#4589)
2025-12-01 23:45:02 +08:00
mla.py
Move mla to ops module (#4575)
2025-11-29 18:36:55 +08:00
register_custom_ops.py
Revert "[Bugfix] Fix Qwen2.5-Omni-7B accuarcy test (#4556)" (#4619)
2025-12-02 13:15:47 +08:00
rotary_embedding.py
[refact] unified soc_version code (#4359)
2025-11-26 14:28:55 +08:00
vocab_parallel_embedding.py
…
weight_prefetch.py
Update torch-npu version to 2.7.1 (#3896)
2025-10-31 17:16:31 +08:00
Powered by Gitea Version: 1.24.3 Page: 3763ms Template: 47ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API