Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
be9e3e85457381fc537bca6c7ad4cb97ad39d32d
xc-llm-ascend/vllm_ascend
History
Mengqing Cao be9e3e8545 [Bugfix] Fix triton placeholder patch period (#704)
Fix triton placeholder patch period

Signed-off-by: MengqingCao <cmq0113@163.com>
2025-04-28 18:52:03 +08:00
..
attention
[Fix] fix deepseek v0 attention eager mode (#671)
2025-04-28 08:53:06 +08:00
core
[BUGFIX] main-sd-bugfix && [UT] add mtp UT (#593)
2025-04-21 19:25:51 +08:00
device_allocator
catch ImportError when C code not compiled (#575)
2025-04-18 18:11:49 +08:00
distributed
[BUGFIX] main-sd-bugfix && [UT] add mtp UT (#593)
2025-04-21 19:25:51 +08:00
lora
[Bugfix] fix import error (#600)
2025-04-22 08:57:25 +08:00
models
[BUILD] Upgrade torch-npu to 2.5.1 (#661)
2025-04-27 17:28:29 +08:00
ops
support aclgraph (#426)
2025-04-23 20:56:24 +08:00
patch
[Bugfix] Fix triton placeholder patch period (#704)
2025-04-28 18:52:03 +08:00
quantization
support deepseek quant & mix-parallel with graphmode (#585)
2025-04-23 16:23:25 +08:00
worker
Remove prompt string from engine core data structures (#663)
2025-04-26 23:15:58 +08:00
__init__.py
[Bugfix] Fix triton placeholder patch period (#704)
2025-04-28 18:52:03 +08:00
envs.py
[MISC] Make vllm version configurable (#651)
2025-04-28 14:19:06 +08:00
platform.py
[V1] Make V1 engine backward compatible (#637)
2025-04-24 17:20:11 +08:00
utils.py
[MISC] Make vllm version configurable (#651)
2025-04-28 14:19:06 +08:00
Powered by Gitea Version: 1.24.3 Page: 92ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API