Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
eec60681878bc62ad971ce79b86152c4234c5bf7
xc-llm-ascend/vllm_ascend/ops
History
zzzzwwjj f1543d5e0d [bugfix] fix deeepseek accuracy (#1118)
### What this PR does / why we need it?
fix deeepseek accuracy in mix-parallel case.


Signed-off-by: zzzzwwjj <1183291235@qq.com>
2025-06-07 21:11:36 +08:00
..
__init__.py
[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph (#836)
2025-05-29 11:58:26 +08:00
activation.py
Optimize qwen2_vl and qwen2_5_vl (#701)
2025-04-30 14:22:38 +08:00
attention.py
[1/N][UT][v1 MTP] add basic v1 mtp features (#890)
2025-05-30 08:59:58 +08:00
cache.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
common_fused_moe.py
[Attention][Kernel]moe support for llama4 and mllama4 (#740)
2025-05-13 19:12:40 +08:00
fused_moe.py
[bugfix] fix deeepseek accuracy (#1118)
2025-06-07 21:11:36 +08:00
layernorm.py
[CI]Add model basic accuracy test(Qwen2.5-0.5B-Instruct) (#460)
2025-04-17 14:59:56 +08:00
rotary_embedding.py
[Bugfix] Correct method call for _set_cos_sin_cache (#774)
2025-05-09 12:55:57 +08:00
vocab_parallel_embedding.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
Powered by Gitea Version: 1.24.3 Page: 161ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API