Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
f1543d5e0d71de4fc4a649c1d20d2d2ac2eb5d7e
xc-llm-ascend/vllm_ascend/ops
History
zzzzwwjj f1543d5e0d [bugfix] fix deeepseek accuracy (#1118)
### What this PR does / why we need it?
fix deeepseek accuracy in mix-parallel case.


Signed-off-by: zzzzwwjj <1183291235@qq.com>
2025-06-07 21:11:36 +08:00
..
__init__.py
[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph (#836)
2025-05-29 11:58:26 +08:00
activation.py
Optimize qwen2_vl and qwen2_5_vl (#701)
2025-04-30 14:22:38 +08:00
attention.py
[1/N][UT][v1 MTP] add basic v1 mtp features (#890)
2025-05-30 08:59:58 +08:00
cache.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
common_fused_moe.py
[Attention][Kernel]moe support for llama4 and mllama4 (#740)
2025-05-13 19:12:40 +08:00
fused_moe.py
[bugfix] fix deeepseek accuracy (#1118)
2025-06-07 21:11:36 +08:00
layernorm.py
[CI]Add model basic accuracy test(Qwen2.5-0.5B-Instruct) (#460)
2025-04-17 14:59:56 +08:00
rotary_embedding.py
[Bugfix] Correct method call for _set_cos_sin_cache (#774)
2025-05-09 12:55:57 +08:00
vocab_parallel_embedding.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
Powered by Gitea Version: 1.24.3 Page: 142ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API