Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
217211d8a3cdaed88d741b439dcb98e1d62300d7
xc-llm-ascend/vllm_ascend/ops
History
rjg-lyh c6ac399091 [Bugfix] Fix the method of importing environment variables in DeepSee… (#817)
### What this PR does / why we need it?
Fix the method of importing environment variables in DeepSeek model to
support successful compilation via aclgraph.

Signed-off-by: rjg-lyh <1318825571@qq.com>
2025-05-13 12:52:30 +08:00
..
__init__.py
[MISC] Clean up torch_npu (#688)
2025-04-29 18:03:38 +08:00
activation.py
Optimize qwen2_vl and qwen2_5_vl (#701)
2025-04-30 14:22:38 +08:00
attention.py
[Bugfix] Fix output tensor shape in vanilla_chunked_prefill and update import paths for model_loader (#773)
2025-05-08 14:19:26 +08:00
cache.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
common_fused_moe.py
[Model] Support common fused moe ops for moe model, such as Qwen3Moe (#709)
2025-04-28 21:57:01 +08:00
fused_moe.py
[Bugfix] Fix the method of importing environment variables in DeepSee… (#817)
2025-05-13 12:52:30 +08:00
layernorm.py
[CI]Add model basic accuracy test(Qwen2.5-0.5B-Instruct) (#460)
2025-04-17 14:59:56 +08:00
rotary_embedding.py
[Bugfix] Correct method call for _set_cos_sin_cache (#774)
2025-05-09 12:55:57 +08:00
vocab_parallel_embedding.py
port deepseekv2 and mtp to main branch (#429)
2025-04-19 17:38:18 +08:00
Powered by Gitea Version: 1.24.3 Page: 97ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API