This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX
/
xc-llm-ascend
Watch
3
Star
0
Fork
0
You've already forked xc-llm-ascend
Code
Issues
Pull Requests
Projects
Releases
Wiki
Activity
Files
e4d898b245de57516260ac546f8c92d126831b55
xc-llm-ascend
/
vllm_ascend
/
worker
History
starkwj
e4d898b245
Some checks failed
Merge Conflict Labeler / main (push)
Has been cancelled
adapt to vllm-ascend v0.18.0rc1
2026-04-21 03:05:32 +00:00
..
v2
[v0.18.0][CI] Fix releases/v0.18.0 ci test only support vllm v0.18.0 (
#7686
)
2026-03-26 18:36:04 +08:00
__init__.py
[Misc][V0 Deprecation] Remove Cache Engine Used for V0 Worker (
#1878
)
2025-07-19 09:42:32 +08:00
block_table.py
[Hybrid] support prefix cache for Qwen3.5/Next with
--mamba-cache-mode align
(
#7103
)
2026-03-15 09:44:09 +08:00
model_runner_v1.py
[Bugfix]v0.18.0 support FlashComm1 & DCP for Qwen (
#7726
)
2026-03-29 15:59:19 +08:00
npu_input_batch.py
[Hybrid] support prefix cache for Qwen3.5/Next with
--mamba-cache-mode align
(
#7103
)
2026-03-15 09:44:09 +08:00
pcp_utils.py
feat(attention_cp): support chunked prefill for Qwen3Next with PCP&DCP (
#6900
)
2026-03-09 17:55:09 +08:00
worker.py
adapt to vllm-ascend v0.18.0rc1
2026-04-21 03:05:32 +00:00