Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Projects Releases Wiki Activity
Files
1e05c4908f31737bc4eef865a9f351d030a77c9d
xc-llm-ascend/vllm_ascend/_310p
History
Li Wang 83a4065b4b [CI] Add pre-commit check for patch logger (#7446)
### What this PR does / why we need it?
See https://github.com/vllm-project/vllm-ascend/pull/7402, pre-commit
hook will forbid init_logger(__name__) in vllm_ascend patch modules

- vLLM version: v0.17.0
- vLLM main:
8a680463fa

---------

Signed-off-by: wangli <wangli858794774@gmail.com>
2026-03-19 16:53:20 +08:00
..
attention
[Refactor] [310p] Support Mamba Cache and support attn_head_size larger than 128 (#7372)
2026-03-19 09:16:22 +08:00
fused_moe
[Version] Drop 0.16.0 support (#7153)
2026-03-13 16:14:15 +08:00
ops
[300I][Bugfix] fix unquant model weight nd2nz error (#6851)
2026-03-03 15:57:26 +08:00
quantization
[CI] Add pre-commit check for patch logger (#7446)
2026-03-19 16:53:20 +08:00
__init__.py
[Feature]: Support 310P device run qwen2.5/3 dense and qwen2.5vl models (#5776)
2026-01-17 11:49:18 +08:00
model_runner_310p.py
[Refactor] [310p] Support Mamba Cache and support attn_head_size larger than 128 (#7372)
2026-03-19 09:16:22 +08:00
sharded_state_loader_310p.py
[Feat][310p] 310P support w8a8s quantization and saving w8a8sc state (#6878)
2026-03-02 20:09:15 +08:00
worker_310p.py
[Refactor] [310p] Support Mamba Cache and support attn_head_size larger than 128 (#7372)
2026-03-19 09:16:22 +08:00
Powered by Gitea Version: 1.24.3 Page: 316ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API