Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
a58b43b72cdae699e8a2ad239a36e41eb97577d6
xc-llm-ascend/vllm_ascend/models
History
liziyu 4c90fa79ca [Misc] Remove useless PD check in deepseek (#2739)
### What this PR does / why we need it?
Remove useless PD check in deepseek


- vLLM version: v0.10.1.1
- vLLM main:
6c7af8110a

---------

Signed-off-by: liziyu <liziyu16@huawei.com>
2025-09-04 22:22:19 +08:00
..
__init__.py
[Misc] Remove redundant imported envs, using envs_ascend instead (#2193)
2025-08-14 09:33:39 +08:00
deepseek_dbo.py
[MISC] Cherry pick #1291 from v0.9.1-dev (#1825)
2025-08-01 09:08:45 +08:00
deepseek_mtp.py
feat: add mtp ut and fix some bugs (#2453)
2025-08-22 17:09:08 +08:00
deepseek_v2.py
[Misc] Remove useless PD check in deepseek (#2739)
2025-09-04 22:22:19 +08:00
deepseek_v3.py
Move deepseek_v3 from deepseek_v2.py (#1793)
2025-07-19 11:37:03 +08:00
pangu_moe.py
[1/N][Draft][Refactor]torchair pangu_moe modeling refactor (#2437)
2025-09-04 10:39:21 +08:00
qwen2_5_vl_without_padding.py
[Bugfix] Fix qwen2.5-vl-without-padding (#2623)
2025-09-03 14:38:55 +08:00
qwen2_5_vl.py
[Quickfix] update CachedRequestState as NewRequestData changed (#2367)
2025-08-15 07:35:27 +08:00
qwen2_vl.py
[Bug] Fix duplicate 'torch.' prefix in qwen-vl (#1986)
2025-07-24 20:16:00 +08:00
qwen3_moe.py
Support v0.10.1 (#2584)
2025-08-28 18:47:53 +08:00
qwen3.py
[main] Use AddRmsNormQuant ops in the custom model to optimize Qwen3's performance (#1806)
2025-07-22 19:03:13 +08:00
Powered by Gitea Version: 1.24.3 Page: 86ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API