Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
4976b48b98f7268a68fe055265f928afd779ccc4
xc-llm-ascend/vllm_ascend/ops
History
zzzzwwjj f1543d5e0d [bugfix] fix deeepseek accuracy (#1118)
### What this PR does / why we need it?
fix deeepseek accuracy in mix-parallel case.


Signed-off-by: zzzzwwjj <1183291235@qq.com>
2025-06-07 21:11:36 +08:00
..
__init__.py
[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph (#836)
2025-05-29 11:58:26 +08:00
activation.py
Optimize qwen2_vl and qwen2_5_vl (#701)
2025-04-30 14:22:38 +08:00
attention.py
[1/N][UT][v1 MTP] add basic v1 mtp features (#890)
2025-05-30 08:59:58 +08:00
cache.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
common_fused_moe.py
[Attention][Kernel]moe support for llama4 and mllama4 (#740)
2025-05-13 19:12:40 +08:00
fused_moe.py
[bugfix] fix deeepseek accuracy (#1118)
2025-06-07 21:11:36 +08:00
layernorm.py
[CI]Add model basic accuracy test(Qwen2.5-0.5B-Instruct) (#460)
2025-04-17 14:59:56 +08:00
rotary_embedding.py
[Bugfix] Correct method call for _set_cos_sin_cache (#774)
2025-05-09 12:55:57 +08:00
vocab_parallel_embedding.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
Powered by Gitea Version: 1.24.3 Page: 280ms Template: 74ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API